Log in
Enquire now
Iterative.ai

Iterative.ai

Iterative.ai is a machine learning and artificial intelligence training platform provider.

OverviewStructured DataIssuesContributors

Contents

iterative.ai
Is a
Company
Company
Organization
Organization

Company attributes

Industry
Big data
Big data
0
Artificial Intelligence (AI)
Artificial Intelligence (AI)
0
ModelOps
ModelOps
0
Technology
Technology
Machine learning
Machine learning
0
Location
San Francisco
San Francisco
0
B2X
B2B
B2B
0
CEO
Dmitry Petrov
Dmitry Petrov
0
Founder
Ivan Shcheklein
Ivan Shcheklein
0
Dmitry Petrov
Dmitry Petrov
0
AngelList URL
angel.co/iterativeai
Pitchbook URL
pitchbook.com/profiles...300262-96
Legal classification
Incorporation (business)
Incorporation (business)
Date Incorporated
2018
Number of Employees (Ranges)
11 – 50
Full Address
450 Townsend Street Suite 100 San Francisco, CA 94107 United States
Investors
Afore Capital
Afore Capital
Jude Gomila
Jude Gomila
True Ventures
True Ventures
468 Capital
468 Capital
Founded Date
2018
0
Total Funding Amount (USD)
43,900,000
Latest Funding Round Date
June 3, 2021
Competitors
Abacus.ai
Abacus.ai
0
Wallaroo (company)
Wallaroo (company)
0
Comet
Comet
0
Weights and Biases
Weights and Biases
0
Encord
Encord
0
IBM
IBM
0
Domino Data Lab
Domino Data Lab
0
Tecton
Tecton
0
...
Business Model
Subscription (with freemium option)0
Also Known As
Iterative0
Latest Funding Type
Series A
Series A
Wellfound ID
iterativeai

Other attributes

Company Operating Status
Active0
Latest Funding Round Amount (USD)
20,000,000
Overview

Iterative.ai is a machine learning operations (MLOps) company focused on streamlining the workflow of data scientists. The company builds developer tools for machine learning that are designed to reduce the complexity of managing datasets, ML infrastructure, and ML models lifecycle management. Iterative.ai's products have been developed by over 200 open-source contributors, engaged with by more than 4000 community members, used by over 400 companies, and awarded more than 7000 Github stars.

DVC

Data Version Control (DVC) enables the capturing of versions of specific data and models in Git commit commands while storing them on-premises or in cloud storage. It also provides a mechanism to switch between different data contents. The result is a single traversable history for data, code, and ML models.

DVC enables data versioning through codification, wherein simple metafiles are produced once by the user, describing which datasets, ML artifacts, and other items should be tracked. This metadata can be put in Git in lieu of large files. DVC can then be used to create snapshots of the data, restore previous versions, reproduce experiments, record evolving metrics, and more.

As DVC is used, unique versions of the user's data files and directories are cached in a systematic way to prevent file duplication. Although the working datastore is separated from the workspace to minimize the project's size, it stays connected via file links handled automatically by DVC.

DVC overview video

Features

The DVC platform offers the following features:

  • Compatibility with Git: DVC is compatible with any standard Git server or provider (GitHub, GitLab, etc.) and can be integrated with any Git repository. Data file contents can be shared by network-accessible storage or any supported cloud solution. DVC offers advantages similar to that of a distributed version control system, such as lock-freedom, local branching, and versioning.
  • Support of various kinds of storage systems: DVC can use Amazon S3, Microsoft Azure Blob Storage, Google Drive, Google Cloud Storage, Aliyun OSS, SSH/SFTP, HDFS, HTTP, network-attached storage, or discs to store data. The platform is continuously updated with new remote storage options.
  • Reproducibility: DVC conserves input, configuration, and code data to enable reproducibility of experiments and failure tracking functionality.
  • Git branching: DVC supports instantaneous, non-duplicative Git branching in all file sizes, allowing for the reuse of a single file across multiple experiments.
  • Metric tracking: DVC includes a command to list all branches, along with metric values, to track progress or aid the user in selecting the desired version of code.
  • ML pipelines framework: DVC has a built-in system that assembles ML steps into a DAG (Directed Acyclic Graph) and runs the pipeline end-to-end. DVC handles caching of intermediate results and does not repeat a step if input data or code are the same.
  • Language- and framework-agnostic operation: DVC is independent of the programming language in use, library types, and code structure; reproducibility and pipelines are based on input and output files or directories. Python, R, Julia, Scala Spark, custom binary, Notebooks, flatfiles/TensorFlow, PyTorch, etc. are all supported.
  • HDFS, Hive & Apache Spark: Spark and Hive jobs may be included in the DVC data versioning cycle, along with local ML modeling steps. Spark and Hive jobs can also be managed with DVC. This enables the decrease of feedback loops by decomposing a large cluster task into smaller DVC pipeline steps, as well as independent iteration on the steps with respect to dependencies.
Uses of DVC

DVC can find uses in the storage and processing of data files and in the production of other data or machine learning models. DVC also enables the user to perform the following:

  • Track and save data and machine learning models in the same way that code is captured
  • Create and switch between versions of data and ML models
  • Gain insight into the original structure of datasets and ML artifacts
  • Compare model metrics among experiments
  • Adopt engineering tools and best practices in data science projects
Advantages

Advantages of the DVC tool include those below:

  • No cost: DVC is a free, open-source, command-line tool and does not require databases, servers, or any other special services to operate.
  • Project readability: File names represent variable data and can remain unchanged, ensuring that projects are kept readable.
  • Data management functionality: DVC provides a storage solution for data and models (e.g. SFTP, S3, HDFS, etc.) that is free from Git hosting constraints. The platform also optimizes the storage and transfer of large files.
  • Collaboration capacities: DVC aids collaborative efforts by allowing straightforward distribution of project development, as well as sharing its data internally and remotely or reusing it elsewhere.
  • Data compliance: Data modification attempts can be reviewed as Git receives requests. The user can audit the project's history to learn when and why datasets or models were approved.
  • GitOps: Data science projects can be connected with the Git-powered platform. Git workflows support tools such as continuous integration (e.g. CML or CI/CD), as well as specialized patterns (e.g. data registries), and other best practices.
CML

Continuous Machine Learning (CML) is an open-source library for implementing CI/CD (continuous integration/delivery) in machine learning projects. It can be used to automate parts of the user's development workflow, including model training and evaluation, comparing ML experiments across the user's project history, and monitoring variable datasets.

CML was developed to enable the use of GitLab or GitHub to manage ML experiments, track whoever trains ML models or modifies data and at what time, and to automatically generate reports for ML experiments, with metrics and plots in every Git Pull Request. CML allows the users to build their own ML platform using GitHub or GitLab and cloud services, such as AWS, Azure, or GCP. Like DVC, CML works independently of extraneous databases and services.

Studio

Iterative.ai's Studio is a collaboration tool for machine learning, offering data and model management, experiment tracking, visualization, and automation. Studio is offered for teams and for individual users and works with other Iterative.ai software products.

MLEM

MLEM is an open-source tool offered by Iterative.ai and is intended to help users simplify machine learning model deployments. MLEM allows users to save an ML model with a Python call, can capture the ML models metadata automatically in a human-readable YAML format, allows users to deploy models where they want, and enables them to switch platforms for deployments. MLEM is developed to help users make a Git model registry and was developed for Git-native ML models.

Timeline

No Timeline data yet.

Funding Rounds

Products

Acquisitions

SBIR/STTR Awards

Patents

Further Resources

Title
Author
Link
Type
Date

MLOps Company Iterative Sees Steady Growth in First Half of 2022

https://www.businesswire.com/news/home/20221006005694/en/MLOps-Company-Iterative-Sees-Steady-Growth-in-First-Half-of-2022

Web

October 6, 2022

References

Find more companies like Iterative.ai

Use the Golden Query Tool to find similar companies in the same industry, location, or by any other field in the Knowledge Graph.
Open Query Tool
Access by API
Golden Query Tool
Golden logo

Company

  • Home
  • Press & Media
  • Blog
  • Careers
  • WE'RE HIRING

Products

  • Knowledge Graph
  • Query Tool
  • Data Requests
  • Knowledge Storage
  • API
  • Pricing
  • Enterprise
  • ChatGPT Plugin

Legal

  • Terms of Service
  • Enterprise Terms of Service
  • Privacy Policy

Help

  • Help center
  • API Documentation
  • Contact Us
By using this site, you agree to our Terms of Service.