— iIT-Services

Archive
Tag "Transcription"

noScribe is an AI-based software designed for transcribing audio, particularly useful for qualitative social research and journalistic interviews. The software is free, open-source (licensed under GPL-3.0), and operates entirely offline, meaning no data is sent to the cloud, ensuring privacy. It can recognize up to 99 languages and distinguish between different speakers, which is particularly helpful for interviews with multiple participants.

The software includes an editor that allows users to review, verify, and correct the transcriptions manually. It utilizes advanced AI models, such as OpenAI’s Whisper, faster-whisper by Guillaume Klein, and pyannote by Hervé Bredin, for the transcription process.

It requires a relatively up-to-date computer to function efficiently, slower systems may result in long transcription times. The software is around 3.7 GB, poor audio quality may lead to transcription errors.

noScribe aims to reduce the difficulty of transcription for researchers and journalists, offering a reliable, private, and easy-to-use tool for processing interviews.

Find a review of noScribe (in German) here: https://sozmethode.hypotheses.org/2315.

Source: https://github.com/kaixxx/noScribe

Read More

In collaboration with the Public Prosecutor’s Office of the Canton of Zurich, the Zurich Cantonal Police, the Zurich City Police and the Winterthur City Police, a team from the Statistical Office of the Canton of Zurich has developed a prototype app that automatically transcribes confidential audio and video files. Also in Swiss German.

The app is open-source and based on the Whisper v3 Large model, which enables transcriptions up to 15 times faster than in real time – without license or usage costs. The app offers a wide range of functions such as automatic speaker recognition, multi-file upload, predefined vocabulary and various export options. Transcripts can be edited directly in the application and linked synchronously with the source file.

Hardware requirements: recommend using a CUDA-compatible graphics card with at least 8GB VRAM, as transcription on a CPU is extremely slow.

Source: https://github.com/machinelearningZH/audio-transcription.

Read More

Automatische Transkription, mit Schweizer Mundart-Erkennung und Branchenfokus: recapp.ch.

Der Dienst töggl ist ein KI-getriebenes Spracherkennungsprogramm mit Fokus auf Schweizer Sprachen, insbesondere Schweizerdeutsch. Satzzeichen werden von selbst gesetzt. Segmentierung nach Sprechern erfolgt ebenfalls automatisch. Nach dem Transkribieren kann der Text manuell überarbeitet und anschliessend exportiert werden: xn--tggl-5qa.ch, töggl.ch.

Review by Digitec “Schwiizertüütsch transkribieren – Töggl im Test”: digitec.ch/de/page/schwiizertueuetsch-transkribieren-toeggl-im-test-22436.

Trint’s AI turns audio & video files to text in 30+ languages. Tell stories faster by transcribing, translating, editing and collaborating in a single workflow: trint.com.

Weitere Reviews: blog.clickomania.ch/2023/02/03/transkriptions-tools-im-vergleich/#4-3, blog.clickomania.ch/2021/12/08/toeggl-ch/.

Weiteres Tool: iit-services.ch/stt4sg.

Read More

Research project on a sentence-level transcription engine from Swiss German audio to Standard German text, by FHNW & ZHAW: https://stt4sg.fhnw.ch.

Read More

Automated transcripts and translations in 40+ languages: https://sonix.ai/.

Read More

Capture, edit, and share audio and video. Descript is an all-in-one audio and video editor that makes editing as easy as a doc. Upload media or record directly in Descript to instantly transcribe your file into text, then tweak the text to directly edit your media clips. Edit out filler words and silent gaps with a single click. Record your screen and webcam for presentations and video messages and edit out mistakes before publishing. Export your project to other pro apps: descript.com.

Read More

fre:ac is an audio converter and CD ripper with support for various popular formats and encoders. It currently converts between MP3, MP4/M4A, WMA, Ogg VorbisFLAC, AAC, WAV and Bonk formats.

With fre:ac you easily rip your audio CDs to MP3 or WMA files for use with your hardware player or convert files that do not play with other audio software. You can even convert whole music libraries retaining the folder and filename structure.

The integrated CD ripper supports the CDDB/freedb online CD database. It will automatically query song information and write it to ID3v2 or other title information tags.

Features

Link: https://www.freac.org.

Read More