San Diego, CA
The Data Dispatch of Hannah Sky Gonzalez
The Gonzalez Times
Driven by Curiosity, Built on Data

Vol. I, No. 1 Data Science · Machine Learning · Analytics
Aspiring Data Scientist Exploring Data Without Borders
Hannah Sky Gonzalez bridges the gap turning data into actionable insights and impact.

Hi, I'm Hannah Sky Gonzalez! I'm a third year Data Science and International Business double major at UC San Diego. As a double major, I've developed a passion for connecting data to real-world applications. When it comes to data science, I use my expertise to follow my curiosity, creating data science questions and taking them from a question to a solution that connects to impact. On campus, I'm involved with Women in Business, where I am a proud Latina in Tech, and work at a Data Science lab where I help inspire future engineers by teaching K-12 students. I focus my work on mentorship and giving back to my community. For me, trying new things is hard, but I enjoy putting myself in difficult situations. As someone who interned in China, I learned to adapt to my surroundings and how to live in a different culture. Overall, my experiences have prepared me for whatever comes next.

Venture outside your comfort zone. The rewards are worth it.
- Rapunzel

Outside of my studies, I'm usually traveling to new countries, grabbing a matcha, or watching basketball. I probably already heard of the latest matcha cafe in the area. I'm a huge fan of college ball and March Madness. I've always been high energy, but I'm also observant when it comes to the strategy behind the plays. I believe the best insights come from having a mix of different experiences and perspectives. Ultimately, I love to explore, observe, and learn.

Hannah Sky Gonzalez
At a Glance
LocationSan Diego, CA
DegreesB.S. Data Science &
B.A. International Business
Recent RoleData Science Analyst Intern at JPMorganChase
PronounsShe/Her/Ella
InterestsTravel, Matcha, Basketball
Featured Projects
NCAA ML
NCAA March Madness Cinderella Anomaly Detection

Developed a Logistic Regression model to predict March Madness "Cinderella" upsets, achieving a 187% F1-score improvement over the baseline through SMOTE oversampling and PCA feature reduction.

PythonScikit-learnPandas
League of Legends ML
League of Legends Position Prediction

Developed a Random Forest Classifier using Scikit-learn to predict player positions across 19,692 rows and 161 features, achieving 77.3% accuracy and improving baseline performance by 4%.

PythonScikit-learnPandas
All Projects →
Career
June 2026 – August 2026
Data Science Analyst Intern
J.P. Morgan Chase
November 2025 – Present
Lab 3.0 Fellow
HDSI · San Diego, CA
June 2025 - August 2025
Research Assistant Intern
LOZO · Shanghai, China
April 2025 - Dec 2024
Market Research Mentor & Intern
Women in Business
Full Resume →
Skills
  • Machine Learning & MLOps
  • Statistics & Feature Engineering
  • Data Analysis & Visualization
  • Big Data Processing
  • Business Intelligence
Recent Works
A selection of data science projects — prediction, analysis, and storytelling with data.
Machine Learning
NCAA March Madness Cinderella Anomaly Detection
NCAA Anomaly Detection

Developed a Logistic Regression model using Scikit-learn to predict March Madness "Cinderella" upsets, achieving a 187% F1-score improvement over the baseline and 62.5% recall through SMOTE oversampling and PCA feature reduction.

Engineered a robust machine learning pipeline in Python to prevent data leakage, ensuring strict chronological evaluation across hold-out test years through StratifiedGroupKFold cross-validation.

Deployed a serverless web application on Vercel using JavaScript and HTML/CSS to visualize model diagnostics, delivering real-time predictions with zero backend latency by executing 100% client-side ML inference.

PythonPandasScikit-learn
Predictive Analytics
Twitter Sentiment Analysis
Twitter Sentiment Analysis

Collaborated with a team to preprocess 1.6M tweets, filter 195 college-related posts, and engineer features, enabling predictive modeling that identified a 25% increase in negative sentiment during finals week.

Built and validated predictive models using t-tests and hypothesis testing in SciPy, confirming a statistically significant correlation between high-stress periods and negative sentiment.

Created interactive visualizations in Plotly and summary statistics in Pandas to communicate sentiment trends over time.

PythonPandasSciPy
Machine Learning
League of Legends Position Prediction
League of Legends Prediction

Developed a Random Forest Classifier using Scikit-learn to predict player positions across 19,692 rows and 161 features, achieving 77.3% accuracy and improving baseline performance by 4%.

Applied statistical hypothesis testing and fairness analysis (p = 0.056) to validate unbiased model performance.

Executed a full end-to-end data modeling pipeline including preprocessing, EDA, and feature scaling; utilized Matplotlib to visualize relationships between variables.

PythonPandasScikit-learnNumPy
Data Visualization
Bikewatching Interactive Data Visualization
Bikewatching Visualization

Created an interactive geospatial visualization of Bluebikes traffic in Boston/Cambridge using Mapbox GL JS for the basemap and D3.js for custom overlays, integrating bike lane data and over 260,000 trip records.

Used a D3 Square Root Scale to accurately size station markers based on total traffic volume and implemented a reactive time slider for temporal data filtering.

Mapbox GL JSD3.js
Market Research
SkyCha Product Launch Plan
SkyCha Launch Plan

Led the development of a product launch plan for SkyChaa, conducting comprehensive competitor and market analyses to identify opportunities and gaps.

Performed customer segmentation and target market profiling to tailor marketing strategies, and developed actionable recommendations to optimize product positioning, adoption, and overall launch success.

Market ResearchStrategy
Predictive Analytics
San Diego MTS Predictive Modeling
MTS Predictive Modeling

Developed predictive models for San Diego's MTS system to forecast bus wait times using historical data, improving insights into service efficiency.

Created comprehensive visualizations, including a county-wide map of all bus lines, to communicate patterns and support data-driven planning for transit operations.

Predictive ModelingVisualization
Experience
June 2026 – Present
Data Science Analyst Intern
JPMorgan Chase & Co. · Jersey City, NJ
  • Spearheading the development of an end-to-end machine learning solution to optimize legal compliance workflows, utilizing Python and predictive modeling to automate similarity detection across legal documents for faster referencing.
  • Collaborating with cross-functional agile teams to align technical deliverables with business objectives, driving the enhancement of machine learning models that support firm-wide regulatory compliance.
Nov. 2025 – Present
Lab 3.0 Fellow
Halıcıoğlu Data Science Institute · San Diego, CA
  • Led instructional workshops for K-12 STEM outreach by quickly learning and teaching Arduino, circuit design, and introductory data science concepts, inspiring the future generation of engineers and data scientists.
  • Expanded program outreach by gathering a database of 200+ school administrators and PTA contacts through targeted online research, increasing the lab's connections for opportunities and student attendance.
June 2025 - Aug. 2025
Research Intern, Product Management
LOZO · Shanghai, China
  • Improved report-saving efficiency by 50% for 100+ users by conducting usability testing on an AI-powered educational web platform.
  • Reduced average task completion time by developing a data-driven UX persona from behavioral data and integrating insights into core feature design updates.
Apr. 2025 - Jun. 2025
Market Research Mentor
Women in Business, UC San Diego · La Jolla, CA
  • Delivered a detailed competitor and UX analysis on Spotify's AI personalization by mentoring a 5-student team through qualitative market research.
  • Presented actionable recommendations on AI modeling to 100+ students by synthesizing research findings and coaching the team on data storytelling.
Oct. 2024 - Dec. 2024
Market Research Intern
Women in Business, UC San Diego · La Jolla, CA
  • Identified feature gaps and growth opportunities by conducting SWOT and competitor analyses of Clarins AI's chatbot with a 5-member team.
  • Guided marketing strategy by designing surveys, collecting 20+ data points, and analyzing responses in Google Sheets to create visual consumer preference dashboards.
Education
2023 – 2027
B.S. Data Science & B.A. International Business
University of California, San Diego

Double major with coursework in data science, machine learning, business strategy, accounting, and international business. Active in Data Science Society, Women in Business, and Women in Computing.

Full Resume View PDF ↗
Resume Preview
Skills
Machine Learning & MLOps
Statistics & Feature Engineering
Data Analysis & Visualization
Big Data Processing
Business Intelligence
Technical Tools & Frameworks
PandasNumPyScikit-learnSciPy MatplotlibPyTorchOpenCVApache Spark & HadoopDockerKubernetes AWSDaskGitGithub PostgreSQLOracleDBMySQLStatsmodels
Technical Languages
PythonSQLRHTML/CSSJavaScriptJava
Languages
EnglishNative
SpanishFluent
Activities
Women in BusinessMember & Mentor · UC San Diego
Women in ComputingMember · UC San Diego
Data Science SocietyMember · UC San Diego
Dispatches & Field Notes
First-hand accounts, reflections, and analyses on my internships, projects, and adventures.
Internship Insights
Inside the JPMorgan Chase Data Science Desk
By Hannah Sky Gonzalez · June 2026

A look into the daily workflow of a Data Science Analyst Intern in Jersey City, building machine learning models for legal compliance and collaborating with agile teams to optimize critical financial systems.

Read Full Story →
Global Perspectives
UX Research in Shanghai: The LOZO Experience
By Hannah Sky Gonzalez · August 2025

From the streets of Shanghai to the conference room, I spent the summer conducting UX research and building behavioral personas for an AI educational web platform. Here's what I learned about cross-cultural product management.

Read Full Story →
Get In Touch
For collaborations, project inquiries, or conversations about data science and global business.
Send a Message
Direct Links
Instagram@hannahsky.g
My Friends & Network
Andy(Di Xuan) WangSWE at Microsoft, CS at UCLA
Ameli Rodriguez GarciaFuture Attorney, CogSci at UCSD
Janie ChanFuture Data Scientist, DS at UCSD