I help businesses and researchers make data-driven decisions by transforming raw, unstructured data into clean, validated, and insightful outputs β from exploratory analysis to dataset creation for machine learning.
My work spans:
- Business-focused data analysis
- High-quality dataset preparation
- Applied machine learning & computer vision
Languages & Libraries
- Python (Pandas, NumPy, Scikit-learn)
- SQL (PostgreSQL, MySQL)
Data Analysis & Visualization
- Exploratory Data Analysis (EDA)
- Matplotlib, Seaborn
- Excel Dashboards & Power Query
Machine Learning & CV
- YOLO (v5βv11) dataset preparation
- Manual annotation & validation
- Data quality checks & inspection pipelines
Tools
- Jupyter Notebooks
- Git & GitHub
- LabelImg
- Microsoft Excel
| Project | What I Did | Tech Used |
|---|---|---|
| π YOLO Person & Vehicle Detection Dataset | Curated & manually annotated 2,000 images with 4,000+ bounding boxes. Built a clean, YOLO-ready dataset with validation & inspection notebook. | Python, YOLO, LabelImg |
| GitHub Repo β’ Kaggle Dataset | ||
| π LA Crime Analysis | Identified peak crime hours (12 PM) to support resource allocation decisions. | Python, Seaborn |
| GitHub Repo | ||
| π Stack Overflow Trends | Analyzed 5-year growth trends of popular programming languages. | Python, Pandas |
| GitHub Repo |
- Clean, analyze, and visualize messy datasets
- Build reproducible EDA notebooks
- Prepare high-quality datasets for ML & CV projects
- Validate annotations and data integrity
- Create portfolio-ready analytical reports
Iβm currently available for freelance data analysis & dataset preparation projects.
- πΌ LinkedIn: https://www.linkedin.com/in/haseeb-uddin-q/
- π§ Email: [email protected]
- π GitHub: https://github.com/Haseeb-U
- π Kaggle: https://www.kaggle.com/haseebhsb
- π Portfolio: https://haseeb-u.github.io/
"Data is just numbers until you tell its story."