Label Studio overview

What is Label Studio?

Label Studio is an open source data labeling tool that supports multiple projects, users, and data types in one platform. It allows you to do the following:

  • Perform different types of labeling with many data formats.

  • Integrate Label Studio with machine learning models to supply predictions for labels (pre-labels), or perform continuous active learning. See Set up machine learning with your labeling process.

Interface


Project List Screenshot


Data Manager Screenshot


Quick View Screenshot

Labeling workflow

Start and finish a labeling project with Label Studio by following these steps:

  1. Create accounts for Label Studio. Create an account to manage and set up labeling projects.
  2. Restrict access to the project. Set up role-based access control. Only available in Label Studio Enterprise Edition.
  3. Set up the labeling project. Define the type of labeling to perform on the dataset and configure project settings.
  4. Set up the labeling interface. Add the labels that you want annotators to apply and customize the labeling interface.
  5. Import data as labeling tasks.
  6. Label and annotate the data.
  7. Review the annotated tasks. Only available in Label Studio Enterprise Edition.
  8. Export the labeled data or the annotations.

Architecture

You can use any of the Label Studio components in your own tools, or customize them to suit your needs. Before customizing Label Studio extensively, you might want to review Label Studio Enterprise Edition to see if it already contains the relevant functionality you want to build. See Label Studio Features for more.

The component parts of Label Studio are available as modular extensible packages that you can integrate into your existing machine learning processes and tools.

Module Technology Description
Label Studio Backend Python and Django Use to perform data labeling.
Label Studio Frontend JavaScript web app using React and MST Perform data labeling in a user interface.
Data Manager JavaScript web app using React Manage data and tasks for labeling.
Machine Learning Backends Python Predict data labels at various parts of the labeling process.