--- title: Automatic Speech Recognition with NVidia NeMo type: guide tier: all order: 60 hide_menu: true hide_frontmatter_title: true meta_title: Automatic Speech Recognition with NeMo meta_description: Tutorial on how to use set up Nvidia NeMo to use for ASR tasks in Label Studio categories: - Audio/Speech Processing - Automatic Speech Recognition - NeMo - NVidia image: "/guide/ml_tutorials/nvidia.png" --- # ASR with NeMo This example demonstrates how to use the [NeMo](https://github.com/NVIDIA/NeMo/blob/main/nemo/collections/asr/README.md) to perform ASR (Automatic Speech Recognition) in Label Studio. Use this model if you want to transcribe and fix your audio data. ## Before you begin Before you begin, you must install the [Label Studio ML backend](https://github.com/HumanSignal/label-studio-ml-backend?tab=readme-ov-file#quickstart). This tutorial uses the [`nemo_asr` example](https://github.com/HumanSignal/label-studio-ml-backend/tree/master/label_studio_ml/examples/nemo_asr). ## Labeling interface This example works with the Label Studio's pre-built **Audio Transcription** template (available under **Audio Processing > Audio Transcription**). ```xml