label-studio.git

编辑 | blame | 历史 | 原始文档

---
title: Optical Character Recognition for Market Research Surveys
type: templates
hide_menu: true
category: Computer Vision
cat: computer-vision
order: 1103

meta_description: Template for using Label Studio to perform optical character recognition (OCR).

Screenshot of labeling interface

In the realm of market research surveys, Optical Character Recognition (OCR) labeled data is essential for enabling AI models to efficiently extract and analyze textual insights from a vast array of unstructured survey responses. These models aim to automate sentiment analysis, trend identification, and customer feedback interpretation, directly affecting data-driven decision-making processes in business strategies.

However, the hurdles in this domain are significant; labeling large volumes of survey data is often time-intensive, leading to potential inconsistencies in labeling accuracy. Additionally, the necessity for domain expertise adds complexity, as annotators must not only accurately identify relevant text but also understand nuanced sentiment and intent. Label Studio effectively addresses these challenges through its hybrid AI + human-in-the-loop approach. The platform employs AI-assisted pre-labeling to accelerate initial data processing, while custom templates and review workflows streamline the annotation process. Moreover, built-in collaboration tools facilitate seamless communication among annotators and experts, ensuring that final labels meet high standards of accuracy and relevance, ultimately enhancing model performance and scalability.

Open in Label Studio

Labeling configuration

<View>
  <Image name="image" value="$ocr"/>
  <Labels name="label" toName="image">
    <Label value="Customer Feedback" background="green"/>
    <Label value="Product Feature Mention" background="blue"/>
    <Label value="Competitor Reference" background="orange"/>
    <Label value="Pricing Comment" background="purple"/>
    <Label value="Service Satisfaction" background="red"/>
  </Labels>
  <Rectangle name="bbox" toName="image" strokeWidth="3"/>
  <Polygon name="poly" toName="image" strokeWidth="3"/>
  <TextArea name="transcription" toName="image"
            editable="true"
            perRegion="true"
            required="true"
            maxSubmissions="1"
            rows="5"
            placeholder="Transcribed Feedback Text"
            displayMode="region-list"
            />
</View>

This configuration performs text extraction tasks on survey response images using multiple shapes. Use this template to select areas with shapes and transcribe the corresponding survey feedback text from each region.

All market research survey text extraction configurations must be enclosed in View tags.

Use the Image object tag to specify the survey response image to annotate:

<Image name="image" value="$ocr"/>

Use the Labels tag to define which annotation categories are available for tagging the different regions in the survey images:

<Labels name="label" toName="image">
  <Label value="Customer Feedback" background="green"/>
  <Label value="Product Feature Mention" background="blue"/>
  <Label value="Competitor Reference" background="orange"/>
  <Label value="Pricing Comment" background="purple"/>
  <Label value="Service Satisfaction" background="red"/>
</Labels>

You can modify each Label's value to assign specific survey-related text category labels like "Customer Feedback" or "Pricing Comment" to segments of the image.

Use the Rectangle tag to add unlabeled rectangular regions for annotation:

<Rectangle name="bbox" toName="image" strokeWidth="3"/>

Using the Rectangle tag (instead of RectangleLabels) allows annotators to first draw regions highlighting text areas, assign labels afterward, and then transcribe the text for each selected region. This workflow fits typical market research survey text transcription and facilitates pre-annotation.

Use the Polygon tag to add unlabeled polygonal regions:

<Polygon name="poly" toName="image" strokeWidth="3"/>

The strokeWidth attribute controls the thickness of the polygon outline.

Use the TextArea tag to add transcription input fields for each labeled region on the image, whether rectangular or polygonal:

<TextArea name="transcription" toName="image"
          editable="true"
          perRegion="true"
          required="true"
          maxSubmissions="1"
          rows="5"
          placeholder="Transcribed Feedback Text"
          displayMode="region-list"
          />

The editable="true" attribute lets annotators revise the transcribed text after submitting it. The displayMode="region-list" shows text areas linked with each region for easier updates. Setting perRegion="true" means each text box corresponds to a specific annotated area, and required="true" enforces text entry before submission. The placeholder sets example prompt text shown in the textbox before entry.