Graduate Studies

DataJam

On Friday, September 27th, CSUN will launch the 3rd Annual DataJam Competition with the theme: Resilient LA!  

The City of Los Angeles was selected as an inaugural member of the 100 Resilient Cities Network in 2013. This global network, pioneered by the Rockefeller Foundation, helps member cities around the world become more resilient to the physical, social, and economic challenges of the 21st century.

In DataJam 2019, students will discover the scope of resilience based on these topics:

  1. Health & Wellbeing (social/health equity)
  2. Economy (industry, business, finance)
  3. Infrastructure - Planning & Development (urban development, land use, mapping)
  4. Infrastructure - Critical Services (transportation, water, energy, communications)
  5. Environment (biology, ecology, climate change)
  6. Disaster (prep, recovery, public safety, risk assessment)

Read the mayor's strategic plan for a Resilient LA here.

Canvas Course

Students are encouraged to enroll in the Canvas course, DataJam 2019, as an introduction to data science, to access resources, and stay informed about the competition. 

Students can self-enroll in the course by following this URL: https://canvas.csun.edu/enroll/NGX4GG. Alternatively, students can sign up at https://canvas.csun.edu/register and use the following join code: NGX4GG.

New to data science? Review Professor Wayne Smith's presentation, "Introduction to Data Science."

Student Team Information

All CSUN students interested in participating must enroll for the DataJam 2019 course on Canvas. The course provides an overview of data science with tutorials on data visualization, mapping and geospatial data, statistical analysis, and using open data. Resources are also provided on resilience topics. Datasets that must be used in competition will be available through the Canvas site immediately following the launch on September 27th. 

Students interested in competing in DataJam 2019 will form teams of at least two and no more than five CSUN students. Teams must register by Monday, October 7th using this form.

September 27th - Launch Schedule

The launch of DataJam 2019 will take place on Friday, September 27th, in the USU Lake View Terrace room. Please RSVP for each session you plan to attend.

Start TimeUSU Lakeview TerraceUSU Flintridge
11:30 am check-in / 12:00 startLunch, Welcome; Sherrie Hixon: Resilience Overview (25 mins) 
12:30 pm break-out sessionsDr. Kerry Nickols: Climate Change (50 mins)Dr. Regan Maas: Disadvantaged Community Modeling (50 mins)
1:30 pm break-out sessionsDr. Roxanne Moschetti: Resilience Fatigue (20 mins)Dr. Kunpeng Li: Big Data in Operations Management (20 mins)
2:00 pm break-out sessionsDr. David McCarty-Caplan: Teachers & Guns: Using Data to Inform Social Policy and Prevent Community Violence (50 mins)Dr. Steve Graves: Fast Food Access and Childhood Fitness in LA (50 mins)
3:00 pm break-out sessionsNairee Bedikian: Portfolium; Sherrie Hixon: Canvas course; Erika Reyes: Student team networking (50 mins)Dr. Natale Zappia: Open Garden: Utilizing Technology to Reimagine Urban Food Systems (20 mins)
4:00  program ends 

October 4th - Data Science Workshops

We are excited to welcome John Peach, Sr. Data Scientist at Amazon Alexa, as our Keynote Speaker! Don't miss this unique opportunity to hear from a leader in this exciting and transforming field!

Workshops will focus on data science topics student teams need to be successful in data analysis and related software programs. Please RSVP for each session you plan to attend.

Start TimeUSU Lakeview TerraceUSU Flintridge
10:30 am break-out sessionsDr. Andrew Ainsworth: Intro to R (50 mins)Dr. Adriano Zambom: Statistical Significance: P-values: How and When to Use Them (50 mins)
11:30 am lunch, keynote @ 12:00 pmJohn Peach, Amazon Alexa (60 mins) 
1:10 pm break-out sessionsDr. Wayne Smith: The Critical Contributions of Arts, Humanities, and Other Non-STEM Majors to an Analytics Team (50 mins)Dr. Mori Jamshidian: Using Statistical Software to De-emphasize Standardization and the Central Limit Theorem in Teaching Statistical Inference (50 mins)
2:05 pm break-out sessionsDr. Katya Mkrtchyan: Automated Quantification of Mosquitoes (20 mins)Dr. Dongling Huang: Tutorial for Azure ML Studio with Business Applications (80 mins)
2:30 pm break-out sessionsDr. Akash Gupta: Introduction to Data Analytics (50 mins)Dr. Huang (continues)
3:30 pmOpen Q&A (30 mins) 

October 18th - Competition Day

Schedule details will be announced once the competing teams are confirmed. Please check back for details.

If you plan to attend, please RSVP

Introduction to Data Science

New to data science? Review Professor Wayne Smith's presentation, "Introduction to Data Science." 

Judging Criteria

Team presentations will be evaluated using the following criteria:

Data Visualization: Teams generate presentations that balance content (subject matter) and aesthetics (use of color, typography, etc.), and cohesion (unity) and coupling (sequence) of presentation material. Teams will: 1) choose the right visual for the purpose/goal of the presentation, 2) link the visuals to exploratory analysis and descriptive statistics, and 3) demonstrate how the visuals suggest further data collection, confirmatory analysis, or predictive modeling techniques.

Data Science: Teams have appropriately stated their objectives and/or hypothesis and used appropriate datasets to explore their hypothesis. Teams have used the appropriate type of analyses to answer their research question and support their interpretation of the results. Teams demonstrate an understanding of sampling, observed variables, distribution-based confirmatory tests, and mathematical calculations.

Reproducible Research: Teams present their investigation consistent with core research integrity that allows for other individuals to: 1) understand the methodological framework and analytic environment of the researcher; 2) access the original data in an open format; 3) replicate each step in the analytical process; 4) verify results using software applications that are freely licensed and readily available; and, 5) access scripts and results in a format that are malleable and publicly accessible.

Insights: Teams generate insights into the use of data and specify: 1) acknowledgement of the limitations of provided (endogenous) data; 2) identification of alternate sources (exogenous) of data that adds value to their hypothesis; 3) appropriate acquisition of other data; and, 4) integration of new data to create information that is more descriptive, diagnostic, predictive, or prescriptive than use of the original data alone. 

Resilience: Teams demonstrate an understanding of the breadth of resilience challenges faced by the LA region. Data is used to appropriately identify a unique resilience-related challenge for the LA region, justify the hypothesis, and support a compelling argument for the team's proposed solution and how it can apply to institutions and/or government agencies.

Judges Choice: Judges will award in this category at their complete discretion. Presentation stands out and exhibits exemplary work, potentially in a category outside of the defined criteria. Judges have complete freedom to award this category to any team for any reason. Potential criteria may include approaches to research, resilience, and/or data science that are considered "novel," "trans-disciplinary," "ethical," "innovative," "strategic," or other. 

DataJam Committee

Many thanks to our DataJam 2018 Committee!

  • Andrew Ainsworth (Psychology, CARE)
  • Elizabeth Altman (Oviatt Library)
  • Meeta Banerjee (Psychology)
  • Kyle Dewey (Computer Science)
  • Helen Heinrich (Information Technology)
  • Sherrie Hixon (Research & Graduate Studies)
  • Charissa Jefferson (Oviatt Library)
  • Crist Khachikian (Research & Graduate Studies)
  • Li Liu (Computer Science)
  • Erika Reyes (Research & Graduate Studies)
  • Chris Salvano (Oviatt Library)
  • Tim Tiemann (CSUN Innovation Incubator)
  • Dongling Huang (Marketing)
  • Adriano Zambom (Mathematics)

DataJam 2019 Awards!!!

Winning team presentations for DataJam 2019 will be announced at the competition on October 18th.

  • Best Data Visualization:

  • Best Data Science:
  • Best Reproducible Research:
  • Best Insights:
  • Best Resilience:
  • Judges Choice:

Thank you to everyone who participated in this year's event!