Golden has been acquired by ComplyAdvantage.Read about it here ⟶

Segment Anything

The Segment Anything Model (SAM) is a promptable segmentation system from Meta AI with zero-shot generalization to unfamiliar objects and images, without the need for additional training.

Overview Structured Data Issues Contributors Activity

All edits

Edits on 10 Apr, 2023

Amy Tomlinson Gayle

edited on 10 Apr, 2023

Edits made to:

Infobox (+2 properties)

Article (+115/-90 characters)

Further Resources (-1 rows) (-3 cells) (-113 characters)

Article

The Segment Anything Model (SAM) is a promptable segmentation system with zero-shot generalization to unfamiliar objects and images, without the need for additional training. Released on April 5, 2023, the Segment Anything project was developed by Meta AI. The company has made both the model and its dataset available under a permissive open license (Apache 2.0) for research purposes. Segmentation is the process of identifying image pixels belonging to an object. Meta already internally uses this technology internally for tasks such as tagging photos, moderating prohibited content, and determining the posts recommended to users on Facebook and Instagram.

...

SAM can identify objects in images from a variety ofvarious input prompts allowing for a wide range of segmentation tasks without requiring additional training. Supported prompts include foreground/background points, bounding boxes, and masks, while ; text prompts are being explored, in the accompanying paperbut the capability is not supported upon the release of the model. SAM's promptable design enables the model to be integrated with other systems.

...

In the blog accompanying the release of SAM, Meta discussed some of the future potential use cases of the model across various industries, including the following:

AI systems—allowing a multimodal understanding of the world,; for example, understanding both the visual and text content of a webpage.
AR/VR—enabling the selection of an object based on a user’s gaze and then “lifting” it into 3D.
Content creation—improving creative applications, such as extracting image regions for collages or video editing.
Science—studying natural occurrences on Earth or even in space,; for example, by localizing animals or objects to study and track in video.

Model

Previously, there were two primary approaches to segmentation. The first, Interactive segmentation that, required a user to iteratively refine a mask. andThe second, automatic segmentation, allowed for specific object categories to be defined ahead of time,. thisThis approach also required training on a substantial amount of manually annotated objects. SAM is a generalization of these two classes in a single model. It can perform both interactive and automatic segmentation, in a flexible way, thanksdue to the model's promptable interface. SAM is also trained on a diverse dataset of over 1 billion masks, enabling it to generalize new types of objects and images.

...

SAM is structured with a VIT-H image encoder that runs once per image, outputting an image embedding. The prompt encoder embeds input prompts, such as clicks or boxes. A lightweight transformer-based mask decoder predicts object masks from the image embedding and prompt embedding.

...

The image encoder has 632M parameters, and the prompt encoder/mask decoder has 4M parameters. The image encoder is implemented in PyTorch and requires a GPU for efficient inference. Both the prompt encoder and mask decoder can run directly with PyTorch or be converted to ONNX. They run efficiently on a CPU or GPU.

Further Resources

Title

Author

Link

Type

Date

Introducing Segment Anything

https://ai.facebook.com/blog/segment-anything-foundation-model-image-segmentation/

Web

Infobox

Blog

https://ai.facebook.com/blog/segment-anything-foundation-model-image-segmentation/

Official Website

https://segment-anything.com/

Edits on 7 Apr, 2023

Arthur Smalley

edited on 7 Apr, 2023

Edits made to:

Infobox (+3/-1 properties)

Timeline (+1 events) (+144 characters)

Description (+16/-4 characters)

Article (+2 images) (+3022 characters)

Further Resources (+2 rows) (+7 cells) (+353 characters)

Segment Anything

The Segment Anything is Model (SAM) is a promptable segmentation system from Meta AI with zero-shot generalization to unfamiliar objects and images, without the need for additional training.

Article

Overview

The Segment Anything Model (SAM) is a promptable segmentation system with zero-shot generalization to unfamiliar objects and images, without the need for additional training. Released on April 5, 2023, the Segment Anything project was developed by Meta AI. The company has made both the model and its dataset available under a permissive open license (Apache 2.0) for research purposes Segmentation is the process of identifying image pixels belonging to an object. Meta already internally uses technology internally for tasks such as tagging photos, moderating prohibited content, and determining the posts recommended to users on Facebook and Instagram.

Example of image segmentation using Segment Anything.

SAM can identify objects in images from a variety of input prompts allowing for a wide range of segmentation tasks without requiring additional training. Supported prompts include foreground/background points, bounding boxes, and masks, while text prompts are explored in the accompanying paper the capability is not supported upon release of the model. SAM's promptable design enables the model to be integrated with other systems

In the blog accompanying the release of SAM, Meta discussed some of the future potential use cases of the model across various industries, including:

AI systems—allowing a multimodal understanding of the world, for example, understanding both the visual and text content of a webpage.
AR/VR—enabling the selection of an object based on a user’s gaze and then “lifting” it into 3D.
Content creation—improving creative applications such as extracting image regions for collages or video editing.
Science—studying natural occurrences on Earth or even in space, for example, by localizing animals or objects to study and track in video.

Model

Previously there were two primary approaches to segmentation. Interactive segmentation that required a user to iteratively refine a mask and automatic segmentation for specific object categories ahead of time, this approach also required training on a substantial amount of manually annotated objects. SAM is a generalization of these two classes in a single model. It can perform both interactive and automatic segmentation, in a flexible way thanks to the model's promptable interface. SAM is also trained on a diverse dataset of over 1 billion masks, enabling it to generalize new types of objects and images.

...

SAM is structured with a VIT-H image encoder that runs once per image, outputting an image embedding. The prompt encoder embeds input prompts such as clicks or boxes. A lightweight transformer-based mask decoder predicts object masks from the image embedding and prompt embedding.

Structure of the Say Anything Model.

The image encoder has 632M parameters and the prompt encoder/mask decoder has 4M parameters. The image encoder is implemented in PyTorch and requires a GPU for efficient inference. Both the prompt encoder and mask decoder can run directly with PyTorch or converted to ONNX. They run efficiently on a CPU or GPU.

Further Resources

Title

Author

Link

Type

Date

Introducing Segment Anything

https://ai.facebook.com/blog/segment-anything-foundation-model-image-segmentation/

Web

Segment Anything

Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Dollár, Ross Girshick

https://arxiv.org/abs/2304.02643

April 5, 2023

Infobox

Official Website

https://segment-anything.com/

First Release

April 5, 2023

GitHub URL

https://github.com/facebookresearch/segment-anything

Industry

Artificial Intelligence (AI)

Timeline

April 5, 2023

Meta releases the Segment Anything Model (SAM), a promptable segmentation system with zero-shot generalization to unfamiliar objects and images.

Edits on 5 Apr, 2023

Jen English

edited on 5 Apr, 2023

Edits made to:

Infobox (+1 properties)

Infobox

Technologies Used

‌

Image segmentation

Jen English

edited on 5 Apr, 2023

Edits made to:

Infobox (+2 properties)

Description (+175 characters)

Segment Anything

Segment Anything is is a promptable segmentation system from Meta AI with zero-shot generalization to unfamiliar objects and images, without the need for additional training.

Infobox

Launch Date

April 5, 2023

Product Parent Company

Meta AI

"Created via: Web app"

Jen English

created this topic on 5 Apr, 2023

Edits made to:

Infobox (+3 properties)

Segment Anything

The Segment Anything Model (SAM) is a promptable segmentation system from Meta AI with zero-shot generalization to unfamiliar objects and images, without the need for additional training.

Infobox

Is a

Product

Software

Official Website

https://segment-anything.com/

Find more entities like Segment Anything

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.

Open Query Tool

Access by API

By using this site, you agree to our Terms of Service.