Skip to content

Conversation

@TimPietruskyRunPod
Copy link
Member

Summary

Add transcription examples using the new pruna/whisper-v3-large model.

Note: This is a draft PR. The @runpod/ai-sdk-provider package needs to be released first with transcription support.

New Examples

  • transcribe-audio-url.js - Transcribe audio from a URL
  • transcribe-audio-file.js - Transcribe audio from a local file (base64)
  • generate-transcription-demo-audio.js - Generate demo audio for testing

Also Fixed

  • generate-speech-chatterbox-turbo.js - Fixed import to use experimental_generateSpeech
  • .gitignore - Added *.wav to ignore generated audio files

Demo Audio

Uploaded to R2: https://image.runpod.ai/demo/transcription-demo.wav

Test plan

  • Wait for @runpod/ai-sdk-provider release with transcription support
  • Update package.json to use released version
  • Run node transcribe-audio-url.js
  • Run node transcribe-audio-file.js

- Add transcribe-audio-url.js - transcription from URL
- Add transcribe-audio-file.js - transcription from local file
- Add generate-transcription-demo-audio.js - generate demo audio
- Fix generate-speech-chatterbox-turbo.js import (experimental_generateSpeech)
- Add *.wav to gitignore
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants