Log in
Enquire now
Synthetic data

Synthetic data

Algorithmically generated information that imitates real data and can substitute for datasets used for testing and training in artificial intelligence.

OverviewStructured DataIssuesContributors

Contents

Is a
Industry
Industry

Other attributes

Wikidata ID
Q7662746

Synthetic data is algorithmically generated information that imitates real data. Synthetic data can substitute for datasets used for testing and training in artificial intelligence (AI) and machine learning. To generate synthetic data, algorithms are fed with smaller real-world data and produce similar data. .

Using synthetic data is an approach to solving problems in AI that come from insufficient data by producing artificial data from scratch or producing novel and diverse training examples using data manipulation techniques. Synthetic data can provide a solution when data sets are too small or the cost of manually labeling data are prohibitively high. Synthetic datasets are cheaper to produce than traditional ones

Synthetically generated datasets can be used to train machine learning models, particularly in computer vision. Synthetic data my augment real datasets to cover parts of the data distribution that are not sufficiently represented to alleviate dataset bias. Synthetic data may also be useful when real data is impossible or prohibitively difficult to acquire due to privacy or legal issues. Synthetic data has been used to train Google’s Waymo in the form of driving simulations. Facebook was reported to use synthetic data to train algorithms to detect bullying language.

Timeline

No Timeline data yet.

Companies in this industry

Further Resources

Title
Author
Link
Type
Date

Deep learning with synthetic data will democratize the tech industry

Evan Nisselson

https://techcrunch.com/2018/05/11/deep-learning-with-synthetic-data-will-democratize-the-tech-industry/

Web

May 11, 2018

Introducing OpenSynthetics: The First Community Hub Focused on Synthetic Data for AI Development

Synthesis AI

https://www.prnewswire.com/news-releases/introducing-opensynthetics-the-first-community-hub-focused-on-synthetic-data-for-ai-development-301525351.html

Web

April 14, 2022

Synthetic Data for Deep Learning

Sergey I. Nikolenko

https://arxiv.org/pdf/1909.11512.pdf

September 25, 2019

References

Find more entities like Synthetic data

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.
Open Query Tool
Access by API
Golden Query Tool
Golden logo

Company

  • Home
  • Press & Media
  • Blog
  • Careers
  • WE'RE HIRING

Products

  • Knowledge Graph
  • Query Tool
  • Data Requests
  • Knowledge Storage
  • API
  • Pricing
  • Enterprise
  • ChatGPT Plugin

Legal

  • Terms of Service
  • Enterprise Terms of Service
  • Privacy Policy

Help

  • Help center
  • API Documentation
  • Contact Us
By using this site, you agree to our Terms of Service.