编辑 | blame | 历史 | 原始文档

---
title: OCR Invoices Pre-NER BIO Format
type: templates
category: Community Contributions
cat: community
order: 1003
meta_title: OCR Invoices Pre-NER BIO Format Data Labeling Template
meta_description: Template for ocr invoices pre-ner bio format with Label Studio
community: true
community_author: redeipirati
community_contributors: carly-bartel
community_repo: awesome-label-studio-config
github_repo: humanSignal/awesome-label-studio-config
report_bug_url: https://github.com/humanSignal/awesome-label-studio-config/issues/new?template=bug_report.yml&config-name=ocr-invoices-pre-ner-bio-format&title=Bug%20in%20ocr-invoices-pre-ner-bio-format&body=I%20found%20a%20bug%20in%20the%20ocr-invoices-pre-ner-bio-format%20template.%0A%0A%23%23%20Steps%20to%20Reproduce%0A1.%20...%0A2.%20...%0A%0A%23%23%20Expected%20Behavior%0A...%0A%0A%23%23%20Actual%20Behavior%0A...%0A%0A%23%23%20Environment%0A-%20Label%20Studio%20Version:...%0A-%20Browser%20(if%20applicable):...%0A-%20Operating%20System:...%0A%0A%23%23%20Additional%20Context%0A...%0A

repo_url: https://github.com/HumanSignal/awesome-label-studio-config/tree/main/label-configs/ocr-invoices-pre-ner-bio-format

OCR text extraction and tokenization with BIO format for invoice documents. All tokens are initially tagged as 'O' (Outside) for subsequent NER tagging.

Labeling Configuration

<View>
  <!-- The image to annotate -->
  <Image name="image" value="$image" zoomControl="true"/>

  <!-- Bounding-box control that will receive the "rectanglelabels" results
       coming from your OCR model (from_name = "label") -->
  <RectangleLabels name="label" toName="image" choice="single">
    <!-- You only emit the generic "O" class, but feel free to add more labels -->
    <Label value="O" background="#FFA500"/>
  </RectangleLabels>

  <!-- Per-region transcription box (from_name = "transcription").
       Because perRegion="true", one TextArea is linked to each rectangle. -->
  <TextArea name="transcription"
            toName="image"
            perRegion="true"
            editable="true"
            rows="1"
            required="true"
            placeholder="Type or correct OCR text…"/>
</View>

About the labeling configuration

All labeling configurations must be wrapped in View tags.

This configuration uses the following tags:

Usage Instructions

This configuration provides a streamlined interface for OCR text verification and correction: