8000 Transcribe: New language models by viren-nadkarni · Pull Request #1700 · localstack/docs · GitHub
[go: up one dir, main page]

Skip to content

Transcribe: New language models #1700

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Mar 21, 2025
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
63 changes: 36 additions & 27 deletions content/en/user-guide/aws/transcribe/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -105,32 +105,7 @@ $ jq .results.transcripts[0].transcript 7844aaa5.json
</disable-copy>
{{< / command >}}

## Resource Browser

The LocalStack Web Application provides a Resource Browser for managing Transcribe Transcription Jobs.
You can access the Resource Browser by opening the LocalStack Web Application in your browser, navigating to the **Resource Browser** section, and then clicking on **Transcribe Service** under the **Machine Learning** section.

<img src="transcribe-resource-browser.png" alt="Transcribe Resource Browser" title="Transcribe Resource Browser" width="900" />
<br>
<br>

The Resource Browser allows you to perform the following actions:

- **Create Transcription Job**: Create a new transcription job by clicking on the **Create Transcription Job** button, and then providing the required details.
- **View Transcription Job**: View the details of a specific transcription job by clicking on the job in the list.
- **Delete Transcription Job**: Delete the transcription job by clicking on the **Actions** button followed by **Remove Selected** button.

## Examples

The following code snippets and sample applications provide practical examples of how to use Transcribe in LocalStack for various use cases:

- [Serverless Transcription App using Transcribe, S3, Lambda, SQS, SES](https://github.com/localstack-samples/sample-serverless-transcribe)

## Current Limitations

Currently, our Transcribe emulation offers only supported formats and languages.

### Supported Formats
## Audio Formats

The following input media formats are supported:

Expand All @@ -142,25 +117,59 @@ The following input media formats are supported:
- Matroska Video files (MKV)
- Waveform Audio File Format (WAV)

### Supported Languages
## Supported Languages

The following languages and dialects are supported:

| Language | Language Code |
| ---------------- | ------------- |
| Catalan | `ca_ES` |
| Czech | `cs_CZ` |
| German | `de-DE` |
| English, British | `en-GB` |
| English, Indian | `en-IN` |
| English, US | `en-US` |
| Spanish | `es-ES` |
| Farsi | `fa-IR` |
| French | `fr-FR` |
| Gujarati | `gu_IN` |
| Hindi | `hi-IN` |
| Italian | `it-IT` |
| Japan | `ja-JP` |
| Kazakh | `kk-KZ` |
| Korean | `ko-KR` |
| Dutch | `nl-NL` |
| Polish | `pl-PL` |
| Portuguese | `pt-BR` |
| Russian | `ru-RU` |
| Telugu | `te-IN` |
| Turkish | `tr-TR` |
| Ukrainian | `uk-UA` |
| Uzbek | `uz-UZ` |
| Vietnamese | `vi-VN` |
| Chinese | `zh-CN` |

## Resource Browser

The LocalStack Web Application provides a Resource Browser for managing Transcribe Transcription Jobs.
You can access the Resource Browser by opening the LocalStack Web Application in your browser, navigating to the **Resource Browser** section, and then clicking on **Transcribe Service** under the **Machine Learning** section.

<img src="transcribe-resource-browser.png" alt="Transcribe Resource Browser" title="Transcribe Resource Browser" width="900" />
<br>
<br>

The Resource Browser allows you to perform the following actions:

- **Create Transcription Job**: Create a new transcription job by clicking on the **Create Transcription Job** button, and then providing the required details.
- **View Transcription Job**: View the details of a specific transcription job by clicking on the job in the list.
- **Delete Transcription Job**: Delete the transcription job by clicking on the **Actions** button followed by **Remove Selected** button.

## Examples

The following code snippets and sample applications provide practical examples of how to use Transcribe in LocalStack for various use cases:

- [Serverless Transcription App using Transcribe, S3, Lambda, SQS, SES](https://github.com/localstack-samples/sample-serverless-transcribe)

## Limitations

Transcribe does not support speaker diarization and does not produce certain other numerical information, like confidence levels.
0