This NSF-funded summer program teaches students the principles of data science and machine learning. Students will learn concepts about data modeling, databases, SQL, data cleaning, data wrangling, and visualization. Students will learn basic Python programming for processing data and learn machine learning techniques, including basic concepts, training, classification, and sentiment analysis.

The program will use the open-source Texera platform to help students get familiar with these concepts even if they have a limited computing background. We will include a capstone project for students to learn these skills by analyzing real data (e.g., social media data) to apply the knowledge to conduct ML-based data science. The instructors and staff include professors and Ph.D. students from UCI and UCLA who are experts in data management, data science, and machine learning.

We had a successful program in 2023, and will continue to hold Data Science for All (DS4ALL) 2024 this year. More information to come.

Faculty Instructors

Dr. Chen Li, Department of Computer Science, UCI

Dr. Wei Wang, Department of Computer Science, UCLA

program Details (tentative)

  • Program date: 07/8/2024 – 07/19/2024
  • Fees: Free of charge (funded by NSF).
  • Lunch: Lunch will be provided during the program.
  • Deadline to apply: 05/22/2024
  • Acceptance notification: 05/26/2024
  • Eligibility: Rising high school students in grades 9th-12th and graduating 12th graders.
  • Prerequisites: Algebra II or Integrated Math II
  • Contact email: ds4all AT ics DOT uci DOT edu
  • Application portal is not available yet, but will be provided here soon.

Program Schedule (Tentative)

7/8 – 7/12, 2024Data science basics using TexeraUCI
7/15 – 7/19, 2024Artificial Intelligence and Machine LearningUCI/UCLA