Long Hoa Chung

...telling impactful stories through data — bridging science and analytics.

About Me

Scientific Image Competition Profile

At the NSW State Parliament House celebrating Science Week in 2023, showcasing an image of human liver that I captured by mass spectrometry and analysed from more than 20,000 data points.

My journey began in biomedical research, where I advanced lipidomics, proteomics, and mass-spec imaging, and has since expanded into AI and data analytics across finance, retail, and healthcare. Each project reflects my drive to combine science, technology, and innovation to solve real-world challenges.

Portfolio

I bring a unique blend of expertise spanning biomedical research and modern data analytics. From pioneering DESI-MS imaging (laboratory + instrument-acquired big data) to building AI-driven models for financial, retail, and healthcare datasets, my work demonstrates how rigorous science and data innovation can deliver impact across industries.

Project Experience

  • AI & Analytics (2024–2025): Optimised AI models for ethical outputs; delivered EDA, dashboards, and predictive models across finance and retail.
  • Healthcare & Biomedical Research (2019–2024): Built data pipelines; advanced lipidomics, proteomics, and DESI-MS imaging.
  • Leadership & Operations (2025): Directed polling site management during the Federal Election.

IBM Data Analyst Capstone – Stack Overflow Survey

Analysed Stack Overflow developer survey data and built IBM Cognos dashboards to reveal tech trends, emerging skills demand, and workforce implications.

Stack Overflow Report
IBM Analyst Dashboard Stack Overflow Data

Solution

Built dashboards using Python + Cognos to track technology adoption, highlight emerging skills, and guide hiring, training, and investment.

Survey Dashboard

Loan Default Prediction (Top 92%)

Developed ML models to predict loan defaults with financial data. Ranked in the top 92% globally in Coursera’s IBM Data Science competition.

Loan Default Top 92%
Loan Default Modelling

Solution

Applied feature engineering and classification models to identify at-risk borrowers, testing robustness on both ordinary and extreme borrower cases.

Financial Features

Exploratory Data Analysis on Retail Data

Analysed online retail transactions to uncover sales trends, customer behaviour, and product performance insights.

Retail Analysis
Exploratory Data

Solution

Delivered insights on top products, seasonal demand, and customer segments to support marketing, forecasting, and retail strategy.

Exploratory Data

ReTimeML: Retention Time Predictor

Predicted lipid retention times in LC-MS using machine learning models, improving annotation accuracy and supporting lipidomics workflows.

Retention Time Predictor
Mass Spectrometry

Solution

Built predictive models with molecular descriptors and chromatographic conditions to estimate lipid retention times, reducing manual validation.

Mass Spectrometry

Customer Churn Prediction

Predicted customer churn for streaming businesses using logistic regression, random forest, and gradient boosting models.

Churn Prediction

Solution

Applied robust scaling across 21 variables, engineered features, and compared models using cross-validation and AUC scores to select the best performer.

Churn Models

Skills

I bring a diverse technical toolkit spanning programming, analytics, and visualization. My expertise in Python, R, SQL, and dashboarding tools (IBM Cognos, matplotlib, seaborn) allows me to translate raw data into clear insights and impactful solutions.

Skills Wordcloud

Experience

Research Scientist

🔬 Research Scientist

PhD scientist with expertise in lipidomics, proteomics, and DESI imaging, publishing novel findings and earning multiple research awards.

Data Scientist

📊 Data Scientist

Transitioned into data science and AI, delivering predictive analytics pipelines, business dashboards, and ranking Top-92 in a global Coursera challenge.

AI & Leadership

🤖 AI & Leadership

Current work spans AI model evaluation (Outlier AI), financial/operational analytics, and leadership roles such as Officer-in-Charge at the Australian Electoral Commission.

Publications

My research combines big data in genomics, proteomics, and lipidomics with advanced computational and instrumental approaches. This integration has enabled novel insights into disease mechanisms such as metabolic disorders, cancer, and neurodegeneration — contributing to improved health and healthier ageing. A key example is my work developing neural-network methods for DIA proteomics and real-time lipidomics.

Summary

  • 5 first/co-first author papers
  • 1 review article
  • 9 collaborative papers
  • 1 book chapter (Full list: ORCID 0000-0003-1834-5747)

Retention Time Prediction

Built machine learning models to predict lipid retention times, improving lipidomics workflows.

Read paper

DESI Imaging for Liver Cancer

Among the first in Australia to apply DESI-MS imaging, revealing metabolic heterogeneity in liver cancer tissue.

View publication

Type 2 Diabetes

My lipidomics methods supported PREVIEW, an international study uncovering lipid roles in diabetic conversion.

View publication

Gene regulationstudy

Revealed how Sox18 mutations impact vascular development and disease progression.

View publication

Lipidomics Review

Co-authored a review summarising lipidomics advances and emerging applications in health research.

View publication

Hobbies

🌹 Gardening

Gardening connects me to nature through flowers, fruit trees, and seasonal plants.

🍳 Cooking

Experimenting in the kitchen is one of my favourite creative outlets.

50+ Recipes Mastered
3 Cuisines Explored

🚗 Travelling

Road trips recharge my energy and inspire fresh perspectives.

15+ Destinations
5000+ KM Travelled

Contact Me

If you'd like to get in touch, please fill out the form below or reach me via social media.