Irsa Ashraf

Irsa Ashraf

Data Scientist + Backend Engineer

Work: Turning messy data into insights via APIs and ML.
Off-the-clock: Biking, NYC Footy, and staying on top of fashion trends.

About Me

I'm a Data Scientist and Backend Engineer working at the intersection of data science and enterprise scale analytics. I've built production-level microservices and APIs that process and present complex data at scale, and ML models that deliver actionable insights.

As a self-taught programmer, I love building my technical skills through hands-on projects and real-world problem-solving.

Outside of work, I'm biking around NYC, playing soccer with my NYC footy team (Go FC Chaos!), trying to get into the 8am Solidcore class, and writing about data and tech on Substack.

ML/AI
Specialization
Writing
On Substack
Building
Side Projects

Skills & Technologies

Data Science & ML

Python PyTorch Keras Scikit-learn Pandas NumPy ML Pipelines

Backend Engineering

Python FastAPI Flask PostgreSQL gRPC API Design Microservices

Cloud & DevOps

AWS Docker Kubernetes CI/CD Git Linux

Analytics & Visualization

SQL Tableau Data Analysis Statistics A/B Testing Jupyter

Featured Projects

AI-Powered Tennis Court Finder

🎾

LLM-powered agent using OpenAI's API to interpret natural language queries about NYC tennis courts. Implements agentic RAG with tool-calling architecture, geocoding, and real-time distance calculations via Dockerized FastAPI.

OpenAI FastAPI Docker Agentic RAG

Tech & Data Writing

✍️

Publishing insights on data science, machine learning, and technology on Substack. Sharing practical experiences, technical tutorials, and thoughts on the evolving landscape of AI and analytics.

Technical Writing Data Science ML Insights

NYC Restaurant Analysis

🍕

Data analysis project exploring eating habits and restaurant trends across New York City neighborhoods. Analyzes patterns in cuisine types, pricing, ratings, and geographic distribution using Python data science tools.

Python Pandas Data Analysis Visualization

Social Media Toxicity Classifier

💬

Multi-class text classification model identifying toxic comments from Wikipedia and YouTube. Preprocessed and balanced datasets, implemented NLP techniques including Bag of Words and CNN architectures for classification.

Python NLP CNN Text Classification

Spotify Genre Clustering

🎵

Unsupervised learning project clustering Spotify songs into genres based on audio features. Applied dimensionality reduction and clustering algorithms to discover patterns in music characteristics and genre boundaries.

Python Scikit-learn Clustering Unsupervised ML

Experience

Applied Data Scientist / Backend Engineer

Citi
Sep 2023 – Present

Data Scientist

University of Chicago, Data Science Institute
Mar – Jun 2023

Data Science Intern

CyberCube
Jun – Aug 2022

Graduate Teaching Assistant - Machine Learning

University of Chicago (Prof. Christopher Clapp)
Mar – Jun 2022

Data Analyst

NexDegree (Unilever)
Sep 2020 – Jul 2021

Strategy and Data Analyst

United Nations, Executive Office of the Secretary-General
Mar – Aug 2019

Education

M.S. Computational Analysis and Public Policy

University of Chicago
2021 – 2023

B.A. Economics

University of California, Los Angeles (UCLA)
2014 – 2018

Get In Touch

I'm always open to discussing new projects, opportunities, or just having a conversation about data, ML, and technology.