Phan Hung Thinh

Data Science & Analytics Professional

Passionate about transforming complex data into actionable insights that drive strategic decisions. With expertise in machine learning, statistical analysis, and data visualization, I help businesses unlock the power of their data to solve real-world problems.

About Me

"Nothing Changes if Nothing Changes"

With 2+ years of experience across startups and large companies, I am dedicated to leveraging data to create meaningful business value. I enjoy collaborating with diverse teams and am committed to delivering practical, data-driven solutions that support organizational growth and innovation.

Skills & Expertise

Programming Languages

Python
SQL
R

Data Analysis & Visualization

Pandas, NumPy
Power BI
Matplotlib, Seaborn

Machine Learning

Scikit-learn
TensorFlow, PyTorch
Time Series Analysis

Tools & Platforms

Databricks
Jupyter Notebook
MLflow

Education

Academic foundation in data science and analytics

Bachelor of Data Science

International University - VNU HCMC
Aug 2019 - Aug 2023

Graduated with a strong foundation in statistical analysis, machine learning, and data visualization. Completed comprehensive coursework in data mining, predictive modeling, and business intelligence with hands-on experience in real-world projects.

Key Coursework & Skills

Statistical Analysis
Machine Learning
Data Visualization
Predictive Modeling
Data Mining
Business Intelligence
Database Management
Big Data Analytics

Featured Projects

Showcasing my data science and analytics work

GMV Forecasting with a Weighted Hybrid Model (SARIMAX + Prophet)

GMV Forecasting with a Weighted Hybrid Model

Developed a weighted hybrid model combining SARIMAX and Prophet to forecast GMV, leveraging both models for improved accuracy in time series prediction.

Python SARIMAX Prophet Time Series
Multiclass Classification with CNN

Multiclass Classification with CNN

Demonstrates how architectural choices significantly impact model performance, with the simpler CNN design achieving better results.

Python TensorFlow CNN Computer Vision
Song Recommendation Engine

Song Recommendation Engine

A decision support system that recommends songs using K-means clustering on song attributes to suggest similar tracks.

Python Streamlit K-Means Recommendation System
Customer Review Analysis

Customer Review Analysis & Sentiment Prediction

Machine learning approach to uncover insights and predict customer sentiments from reviews using NLP techniques.

Python NLP Scikit-learn Sentiment Analysis

Work Experience

My professional journey

Data Analyst

Grab

Nov 2023 - Present

  • Machine Learning & Forecasting: Developed machine learning models in Databricks to enhance operational decision-making. Delivered a driver safety gear classification model (e.g., detecting missing equipment) with ~92% precision, and a GMV forecasting model with ~4% MAPE. Outputs were automated and integrated into internal data systems for business visibility.
  • Data Engineering & Automation: Developed and scheduled data pipelines in Databricks using SQL (Presto/Trino), PySpark, and Python to streamline reporting and analytics. Automated workflows to support high-volume data processes and reduce manual intervention.
  • Data Quality & Reliability: Implemented robust data integrity and source validation mechanisms, significantly improving consistency and reducing the need for manual preprocessing in downstream workflows.
  • Data Visualization & Reporting: Designed Power BI dashboards for regional performance reviews, enabling data-driven decisions across teams. Enhanced reporting efficiency by automating processes, reducing manual effort by over 70%.
  • Cross-functional Collaboration: Partnered with Finance, Operations, and Strategy teams to identify challenges and deliver impactful data solutions (ad-hoc analyses, insights, and dashboards).

Data Science Intern

Unilever

Oct - Nov 2023

  • Built machine learning models for demand sensing to improve sales forecast accuracy.
  • Engineered time series features and evaluated model performance using real-world FMCG data.

Data Science Internship

FWD Insurance

Jun - Aug 2023

  • Supported feature engineering for fraud detection model to avoid claim abuse cases.
  • Built interactive dashboards in Power BI to support performance tracking.

Data Analyst Intern

Logivan

Jun - Dec 2022

  • Built dashboards in Holistics (BI Platform) for shipment tracking and partner performance.
  • Analyzed logistics data to improve route efficiency and reduce delivery delays.

Certificates

Professional certifications and continuous learning

WorldQuant University Logo

Applied AI Lab: Deep Learning for Computer Vision

WorldQuant University
Issued: June 2025
Gained hands-on experience in data preparation, deep learning (MLPs, CNNs, transformers), and tasks such as image classification, object detection, and generative AI, while applying transfer learning, exploring AI libraries, and addressing ethical and environmental issues in AI development.
Deep Learning Machine Learning Image Classification Object Detection
Google Logo

Google Data Analytics Professional Certificate

Google (Coursera)
Issued: April 2022
Comprehensive program covering data analysis process, tools, and techniques including data cleaning, visualization, and statistical analysis.
Data Analysis SQL Python Data Visualization Data Analysis
IBM Logo

BI Foundations with SQL, ETL and Data Warehousing

IBM (Coursera)
Issued: July 2022
Gained foundational skills in Linux commands and shell scripting, SQL for data analysis, and building ETL and data pipelines using tools like Airflow and Kafka. Also developed a strong understanding of data warehouse concepts and architecture.
Extract, Transform, Load Data Warehousing Python Data Pipelines Databases

Get In Touch

Let's discuss how data can drive your business forward

Location

Ho Chi Minh City, Vietnam

Phone

+84 398 035 345

Resume

Download CV

Ready to collaborate on your next data science project?

Send Message