Data Science for All (2025)

This NSF-funded summer program teaches students the principles of data science and machine learning. Students will learn concepts about data modeling, data cleaning, data wrangling, and visualization. Students will learn basic Python programming for processing data and learn ML techniques, including basic concepts, training, classification, and sentiment analysis.

The program will use the open-source Texera platform to help students get familiar with these concepts even if they have a limited computing background. We will include a capstone project for students to learn these skills by analyzing real data (e.g., social media data) to apply the knowledge to conduct ML-based data science. The instructors and staff include professors and Ph.D. students from UCI and UCLA who are experts in data management, data science, AI, and machine learning. A gallery showcasing the classroom and learning environment for the program is available.

Students who complete the program without any absence will receive a certificate from the program.

Faculty Instructors


Dr. Chen Li, Department of Computer Science, UCI

Dr. Wei Wang, Department of Computer Science, UCLA

PhD Student Instructors (TBD)


program Details

Program schedule (Tentative)
Date Morning (Lecture) Afternoon (Lab)
07/07/2025 Intro, Program Orientation;
Texera overview
Use Texera to create the first datasets and workflows
07/08/2025 Texera operators, importing data,
data wrangling, visualization
Use Texera to do cleaning and wrangling
07/09/2025 Texera: user‑defined functions in Python Use Texera for more‑complicated analysis with Python UDFs
07/10/2025 ML model: Classification, clustering Construct workflows to do classification and clustering
07/11/2025 ML model: neural networks (CNN, RNN, MLP, transformers) Construct workflows with UDF operators for representative NN models
  • Program time: 07/07/2025 – 07/11/2025 (One week)
  • Daily Schedule: From 9 AM to 4 PM
  • Location: UCI Campus (classroom to be decided)
  • Instruction Format: The program will consist of both lectures and lab sessions.
  • Fees: Free of charge (funded by NSF)
  • Lunch: Lunch will be provided during the program.
  • Deadline to apply: 04/25/2025 (Friday) by 11:59PM PDT
  • Acceptance notification: Before 05/16/2025 (Friday)
  • Eligibility: 9th, 10th and 11th graders (as of April 2025)
  • Prerequisites: Algebra II or Integrated Math II
  • Contact email:

Program Schedule (to be decided)