Skip to content
View HTLinh0604's full-sized avatar
👨‍💻
Try hard
👨‍💻
Try hard
  • HUTECH - Ho Chi Minh City University of Technology
  • Tp. Hồ Chí Minh

Block or report HTLinh0604

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
HTLinh0604/README.md

Hi 👋, I'm Huỳnh Thái Linh!

About me 🧑

  • 🌱 I’m currently Data Science learning at Ho Chi Minh City University of Technology.
  • 📊 Actively pursuing knowledge and proficiency in data analysis, Machine Learning, and Deep Learning.
  • 🎯 My goal is to master these core areas.
  • ⚡ Fun fact: I love workout calisthenics.
  • 📫 You can reach me at: [email protected]

Connect with me:

đoàn-trí-hùng-29a077252 oantrihung.197429 doantri.hung


Languages and Tools:

Python   R   LaTeX   NumPy   Pandas   Scikit-learn   Matplotlib   TensorFlow   Flask   GitHub


📈 GitHub Stats

Huỳnh Thái Linh's GitHub Stats Top Languages

Pinned Loading

  1. invoice_ai_automation invoice_ai_automation Public

    This project transforms messy invoice images into a structured, searchable knowledge base. The pipeline automatically extracts text with Tesseract, uses Google Gemini to parse fields (vendor, total…

    Python 2

  2. Programming_Project_Clustering Programming_Project_Clustering Public

    An unsupervised clustering analysis of over 57,000 GitHub projects using their README.md text. This study compares traditional keyword methods (TF-IDF) against modern semantic embeddings (Sentence-…

    Jupyter Notebook 2 1

  3. topic_modeling_github_readmes topic_modeling_github_readmes Public

    This project presents a comprehensive comparison of classical, embedding-based, and hierarchical (hyperbolic) topic modeling approaches on over 57K GitHub README files, highlighting the proposed G…

    Jupyter Notebook 2

  4. repo_topic_classification repo_topic_classification Public

    This project automatically classifies GitHub repositories using README.md text. It combines classical ML models with fine-tuned Transformers (Mistral-7B + PEFT/LoRA) on a large dataset of 50+ IT to…

    Jupyter Notebook 2

  5. github_dev_network_clustering github_dev_network_clustering Public

    This research focuses on group detection and collaboration analysis among GitHub developers. It builds a commit-based collaboration network, where nodes represent developers and edges indicate join…

    Jupyter Notebook 1 1

  6. Invoice-data-extraction Invoice-data-extraction Public

    This project demonstrates a classic OCR pipeline. This Flask app takes an image, applies an OpenCV preprocessing pipeline, and uses Tesseract OCR to digitize Vietnamese invoices (Bách Hóa Xanh)..

    Python 2 1