Synthetic data project framework

Open
Chromatic Data
Mississauga, Ontario, Canada
He / They
Founder & CEO
(2)
5
Project
Academic experience or paid work
60 hours of work total
Learner
Anywhere
Advanced level

Project scope

Categories
Data analysis Data modelling Machine learning Artificial intelligence Data science
Skills
statistical programming statistical inference multivariate statistics nonparametric statistics data science ai/ml inference machine learning algorithms
Details

Our company is interested in creating frameworks / templates for our pilot projects with social impact clients. Impact of this work is to maintain efficient operational execution in how we structure our client work.


We would like to collaborate with students to provide cohesive, appropriate details relative to the pilot project scaffolding from our Data Scientists. Students will write technical statistical documentation on methodologies and write code (likely in Python, using Jupyter and/or Marimo notebook) so we can achieve a fulsome pre-pilot understanding of the workflow involved.


This will involve several different steps for the students, including:

  • Working with Data Scientist guidance to write code that assesses initial data type, structure, etc, especially on determining appropriate analytical tooling for client data use case categories
  • Expanding on baseline Data Scientist framework document for synthetic data 'utility' metrics, i.e. how useful to a given statistical model the synthetic data is
  • Completing all above in analytical objectives including but not limited to dimension reduction, relationship analysis, and clustering
Deliverables

By the end of the project, students should demonstrate:

  • Understanding of key statistical modeling processes particularly as across different types and structures of social impact data
  • Development of core applied math and computer science skills in statistics and programming AI/ML applications

Final deliverables should include:

  • Source materials such as the data platform's back-end code (automated tooling by use case), if applicable, and accompanying technical documentation (data assessments, utility metrics) guiding our future pilot project work
Mentorship
Domain expertise and knowledge

Providing specialized, in-depth knowledge and general industry insights for a comprehensive understanding.

Tools and/or resources

Providing access to necessary tools, software, and resources required for project completion.

Regular meetings

Scheduled check-ins to discuss progress, address challenges, and provide feedback.

Supported causes

The global challenges this project addresses, aligning with the United Nations Sustainable Development Goals (SDGs). Learn more about all 17 SDGs here.

Industry, innovation and infrastructure

About the company

Company
Mississauga, Ontario, Canada
0 - 1 employees
Business & management, It & computing, Non-profit, philanthropic & civil society, Technology, Trade & international business
Representation
Minority-Owned BIPOC-Owned 2slgbtqia+-owned Small Business Sustainable/green
+ 2

We're solving the existential pain point of the social impact sectors - funding scarcity - with data creation, management, and analysis services.