About me

Hi, I’m Chenyi Weng — a curious problem solver and data storyteller currently pursuing my Master’s in Spatial Data Science at the University of Southern California (expected Dec 2025).

My journey started with a BBA in International Business and a minor in Information Management, where I built a strong foundation in global trade, finance, and data management. Along the way, I realized that the real power lies in how we transform raw data into insights that influence decisions. That realization led me to combine my business background with advanced technical skills, bridging the gap between strategic thinking and data-driven execution.

At USC, I’ve sharpened my expertise in machine learning, scalable systems, and interactive data visualization. From building classification pipelines that achieve 95%+ accuracy to designing Spark-powered data extraction frameworks that handle millions of records, I enjoy translating complex data challenges into scalable, impactful solutions. My projects range from deep learning for image/audio analysis to interactive dashboards that empower decision-making — all rooted in a passion for using technology to solve real-world problems.

Technically, I’m fluent in Python (Flask, FastAPI, scikit-learn, TensorFlow, Keras), JavaScript (React, Node.js), SQL/NoSQL, and cloud computing with AWS/Docker, and I thrive at the intersection of software engineering and data science.

Beyond the code, what drives me is impact. I aspire to contribute to teams where I can turn messy, large-scale data into actionable insights and build applications that scale, especially in domains like urban systems, business intelligence, and AI-driven products.

If you’re looking for someone who blends business acumen with technical depth — and who genuinely enjoys the challenge of building things that matter — I’d love to connect.

What i'm doing

  • design icon

    Full-Stack & Cloud Development

    Experience with React, Node.js, PostgreSQL, and CI/CD; skilled in building interactive dashboards and APIs.

  • Web development icon

    Machine Learning & AI Development

    Built ML pipelines (Logistic Regression, Random Forest, SVM, CNNs) with >95% accuracy on structured, image, and audio datasets.

  • mobile app icon

    Data Visualization & Analytics

    Designed dashboards with Plotly/Dash for million-scale datasets with filtering and drill-down features.

  • camera icon

    Scalable Data Systems

    Developed Python + Spark pipeline for large-scale data extraction and cleaning, achieving >90% data accuracy.

Resume

Education

  1. University of Southern California — M.S. in Spatial Data Science (STEM)

    Los Angeles, CA · Jan 2024 — Dec 2025 · GPA: 3.71/4.0

    Coursework: DSCI552 Machine Learning · DSCI550 Data Science at Scale · DSCI510 Programming for Data Science · DSCI549 Computational Thinking.
    Focus: Scalable ML pipelines (Python/Spark), geospatial analytics, and interactive data visualization.

  2. National Taipei University of Business — BBA, International Business (Minor: Information Management)

    Taipei, Taiwan · Sep 2018 — Jun 2022

    Coursework: Data Structures, OOP, Operating Systems, Computer Security.
    Focus: Python for data analysis/ML, database management, and data-driven business strategy.

Experience

  1. Graduate Research & Projects — USC Viterbi School of Engineering

    Los Angeles, CA · Jan 2025 — Aug 2025

    • Built end-to-end ML pipelines (Logistic Regression, SVM, Random Forest, CNNs) and achieved >95% accuracy across structured, image, and audio datasets.
    • Developed Python + Spark ETL to extract/clean large-scale unstructured data with >90% clean-data accuracy; enabled reproducible batch processing.
    • Designed interactive dashboards (Plotly/Dash) for million-scale exploration with filtering and drill-down, improving analysis speed and decision support.

  2. Data Analyst — OMNITREE LTD.

    Taipei, Taiwan · Jun 2021 — Sep 2021

    • Automated feature extraction and SEO reporting with Python; accelerated workflow and improved target keyword rankings and page-view growth.

  3. Data Analytics Intern — Linfair Records Ltd.

    Taipei, Taiwan · May 2019 — Aug 2019

    • Cleaned and validated large copyright datasets (SQL) and produced analytics to support intellectual-property enforcement and reporting accuracy.

My skills

  • Python (NumPy · Pandas · scikit-learn · TensorFlow · Keras)
    Proficient
  • Java / Go / JavaScript · TypeScript
    Advanced
  • React · Node.js · HTML/CSS
    Intermediate
  • Machine Learning / Deep Learning
    Proficient
  • Big Data & Pipelines (Spark · ETL · Parallel)
    Advanced
  • SQL & Data Viz (PostgreSQL · Plotly/Dash · Matplotlib)
    Advanced
  • Git · CI/CD · REST APIs
    Intermediate
  • Cloud (AWS – basic)
    Intermediate
  • GIS & Spatial Analysis (ArcGIS Pro)
    Supplementary

Coursework

Vlog

Contact