Audio Speech to Text with openAI whisper model Icelandic to English

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • LazyPooky
    ADMINISTRATOR
    Level 35 - Rockin' Poster
    • Oct 2007
    • 7042

    Audio Speech to Text with openAI whisper model Icelandic to English

    The results with AI are exceptional, very accurate and super fast. This topic will discuss the possibilities of AI for the LazyTown Community.

    Speech to text

    First I would like to mention the transcription of speech to text. I have quite a few videos/audio with the Icelandic language and a lot of them I don't know what is being said. I had an idea about some of them, but it turned out not to be quite right.​ I use OpenAI with Whisper models. Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. There is now a large-v2 model which does accurate and incredible fast work (on GPU-NVIDIA). The output is simple TXT, and it also creates automatically subtitle files: SRT, VTT .

    Click image for larger version  Name:	whisper-model-transcriptions.jpg Views:	0 Size:	252.0 KB ID:	194401

    More here:
    https://github.com/openai/whisper
    https://huggingface.co/openai/whisper-large
    Magnús: - I have fans of all ages and I don't think it's weird when older people like LazyTown. LazyTown appeals to people for many different reasons: dancing, acrobatics, etc.
  • LazyPooky
    ADMINISTRATOR
    Level 35 - Rockin' Poster
    • Oct 2007
    • 7042

    #2
    Here are quite a number of whisper models you can choose from, and not only speech-to-text.
    https://huggingface.co/models?sort=t...search=whisper

    Hugginface is one of the largest AI communities that shares new models and datasets.
    https://huggingface.co
    Magnús: - I have fans of all ages and I don't think it's weird when older people like LazyTown. LazyTown appeals to people for many different reasons: dancing, acrobatics, etc.

    Comment

    • boredjedi
      Master
      SPECIAL MEMBER
      MODERATOR
      Level 35 - Rockin' Poster
      • Jun 2007
      • 6971

      #3
      I'm going to start delving into AI. I been looking into the AI sites.
      To the chagrin of Chuft no doubt. I know he's not partial to the AI.
      But there's not stopping it now unfortunately. It's either adapt or
      be left behind.
      http://eighteenlightyearsago.ytmnd.com/

      Comment

      • LazyPooky
        ADMINISTRATOR
        Level 35 - Rockin' Poster
        • Oct 2007
        • 7042

        #4
        Originally posted by boredjedi
        I'm going to start delving into AI. I been looking into the AI sites.
        To the chagrin of Chuft no doubt. I know he's not partial to the AI.
        But there's not stopping it now unfortunately. It's either adapt or
        be left behind.
        That's great. I try to let AI work locally on Win10, and find software mostly on Github. You don't have to be a nerd or have linux installed. Just read the documentation and follow it. What can be seen online can work local.
        Magnús: - I have fans of all ages and I don't think it's weird when older people like LazyTown. LazyTown appeals to people for many different reasons: dancing, acrobatics, etc.

        Comment

        Working...