Argilla is an open-source data curation platform for large language models (LLMs).
Argilla is an open-source data curation platform for large language models (LLMs).
Argilla is an open-source data curation platform for large language modelslarge language models (LLMs). Argilla aims to help users build language models faster through data curation with both human and machine feedback, providing support for steps in the MLOps cycle, including data labeling and model monitoring. Argilla is compatible with major NLPNLP libraries, including Hugging FaceHugging Face transformers, spaCyspaCy, Stanford Stanza, and Flair. Users can combine their preferred libraries without having to implement a specific interface. Argilla allows users to combine hand-labeling with active learning, bulk-labeling, zero-shot models, and weak supervision in novel data annotation workflows.
Argilla is a Spanish company that was founded in 2017 by Daniel Vila Suero (CEO) and Francisco Aranda. The company was originally called Recognai, and its first product, launched in 2021, was named Rubrix. In October 2022, Recognai rebranded its name and platform to Argilla. The new name comes from the word "clay" in Latin and the idea of modeling clay and shaping models. Argilla has locations in Madrid and Carpresa, Valencia.
A Spanish company, Argilla was founded in 2017 by Daniel Vila Suero (CEO) and Francisco Aranda. The company was originally called Recognai and its first product, launched in 2021, was named Rubrix. In October 2022, Recognai rebranded its name and platform to Argilla. The new name comes from the word "clay" in Latin and the idea of modeling clay and shaping models. Argilla has locations in Madrid and Carpresa, Valencia. On January 25, 2023, Argilla announced $1.6M in funding from Zetta Venture Partners (Kaggle, Domino Data Lab, Weaviate) and Caixa Capital Risc (Vilnyx, Codee).
On January 25, 2023, Argilla announced $1.6M in funding from Zetta Venture Partners (Kaggle, Domino Data Lab, Weaviate) and Caixa Capital Risc (Vilnyx, Codee).
We areArgilla buildingis thean open-source data curation platform for data-centric NLPLLMs.
Argilla is an open-source data curation platform for large language models (LLMs). Argilla aims to help users build language models faster through data curation with both human and machine feedback, providing support for steps in the MLOps cycle, including data labeling and model monitoring. Argilla is compatible with major NLP libraries, including Hugging Face transformers, spaCy, Stanford Stanza, and Flair. Users can combine their preferred libraries without having to implement a specific interface. Argilla allows users to combine hand-labeling with active learning, bulk-labeling, zero-shot models, and weak supervision in novel data annotation workflows.
Argilla is built on five core components:
A Spanish company, Argilla was founded in 2017 by Daniel Vila Suero (CEO) and Francisco Aranda. The company was originally called Recognai and its first product, launched in 2021, was named Rubrix. In October 2022, Recognai rebranded its name and platform to Argilla. The new name comes from the word "clay" in Latin and the idea of modeling clay and shaping models. Argilla has locations in Madrid and Carpresa, Valencia. On January 25, 2023, Argilla announced $1.6M in funding from Zetta Venture Partners (Kaggle, Domino Data Lab, Weaviate) and Caixa Capital Risc (Vilnyx, Codee).
January 25, 2023
October 25, 2022
2021
2017
We are building the open-source platform for data-centric NLP.
Argilla is an open-source data curation platform for large language models (LLMs).