Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

main branch may have a streaming issue #215

Open
HelloSzymon opened this issue Oct 3, 2024 · 1 comment
Open

main branch may have a streaming issue #215

HelloSzymon opened this issue Oct 3, 2024 · 1 comment
Labels
needs info Further information is requested

Comments

@HelloSzymon
Copy link

Hello,

I have i think weird problem. Code taken from ContentView of the SDK is not transcribing the voice. The contentView code was tested on iphone simulator/ iphone real device and apple vision pro simulator. On all same results so some random text:

Simulator Screenshot - Apple Vision Pro - 2024-10-04 at 00 06 42

I also tried with basic implementation which is provided in documentation and print(transcription) returns me nil. For now i do not know what and where i might find problem. Package depndencies is set to main branch. Audio file is set correctly.


import SwiftUI
import WhisperKit

struct ContentView: View {
    var body: some View {
        VStack {
            Image(systemName: "globe")
                .imageScale(.large)
                .foregroundStyle(.tint)
            Text("Hello, world!")
        }
        .padding()
        .onAppear{
            Task {
               let pipe = try? await WhisperKit()
                let path = Bundle.main.url(forResource: "MP3sample", withExtension: "mp3")!.absoluteString

               let transcription = try? await pipe!.transcribe(audioPath: path)?.text
                print(transcription)
            }
            
        }
    }
@atiorh
Copy link
Contributor

atiorh commented Oct 5, 2024

@HelloSzymon

  • Please depend on a release tag instead of main if this is for your product.
  • We will create a new release this week with the latest main.
  • For development, please report the issue with your OS version and real device name.
  • I recommend testing "Transcribe > From File" (as opposed to streaming) first so we can narrow down your issue to the mic input source or the inference engine.
  • Finally, the title of the issue is not descriptive.

@atiorh atiorh changed the title WhisperKit is basically not working (Source file / recording) main branch may have a streaming issue Oct 5, 2024
@atiorh atiorh added the needs info Further information is requested label Oct 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
needs info Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants