Data Scientist

I turn messy data into models people can trust.

A junior data scientist with a background in business informatics. I work end to end, from hand-labeling data to deploying inference, and I care about results that hold up under scrutiny.

Ali Akil

About

I care about the part of machine learning that's easy to skip: checking whether a model is genuinely better before I trust the number.

I came to data science from business informatics. I like the whole loop: framing the question, building a dataset when none exists yet, then testing the result until it holds up.

My main project, OIRseg, took a segmentation model from a few hundred hand-drawn masks to a validated web app that scores Dice 0.916 on its primary class. I want more work like that, where careful labeling and honest evaluation lead to something that actually ships.

Toolbox

Languages & Data

  • SQL
  • NumPy
  • Python
  • Pandas
  • Polars
  • DuckDB

Machine Learning

  • SciPy
  • CatBoost
  • statsmodels
  • scikit-learn

Deep Learning & CV

  • PyTorch
  • ImageJ / Fiji
  • TensorFlow / Keras
  • segmentation-models-pytorch

LLM, RAG & Agents

  • RAG
  • chunking
  • LangChain
  • reranking
  • embeddings
  • Fine-tuning
  • Vector databases
  • Multi-agent orchestration

Serving & Apps

  • FastAPI
  • Streamlit
  • Hugging Face Spaces

Cloud & Data Eng

  • Spark
  • MySQL
  • Databricks
  • Azure (Synapse, ADLS)
  • GCP (BigQuery, Cloud Run)

BI & Viz

  • Plotly
  • Tableau
  • Power BI
  • Looker Studio

MLOps & Quality

  • ruff
  • Docker
  • pytest
  • pre-commit
  • Git / GitHub Actions
  • Evaluation & observability

Let's work together.

Open to data science roles and collaborations. The fastest way to reach me is email.