Skip to content

Latest commit

 

History

History
80 lines (52 loc) · 2.08 KB

File metadata and controls

80 lines (52 loc) · 2.08 KB

Audio Transcript Generator

Table of Contents

About

The Audio Transcript Generator is a Python application that takes an MP3 or WAV audio file as input, processes the audio, and outputs the transcribed text of the audio content. This project leverages the power of speech recognition to convert spoken words into written text.

Demo

Features

  • Transcribe audio files (MP3/WAV) to text.
  • Easy-to-use command-line interface.
  • Supports multiple languages for transcription.
  • Customizable output options.

Getting Started

Prerequisites

  • Python 3.6 or higher
  • FFmpeg (required for processing audio files)

Installation

  1. Clone this repository to your local machine:

    git clone https://github.com/gpapadopp/voice-recognition-python.git
  2. Navigate to the project directory:

    cd voice-recognition-python
  3. Install the required Python packages:

    pip install -r requirements.txt

Usage

  1. Place your MP3 or WAV audio file in the project directory.

  2. Open a the main.py file of the project, go to the last row, and place the file name of your MP3 or WAV audio file.

  3. The transcription will be displayed in the terminal.

Contributing

Contributions are welcome! If you'd like to contribute to this project, please follow these steps:

  1. Fork the repository.
  2. Create a new branch for your feature/bugfix: git checkout -b feature-name.
  3. Make your changes and commit them: git commit -m "Description of changes".
  4. Push your changes to your fork: git push origin feature-name.
  5. Create a pull request detailing your changes.

License

This project is licensed under the MIT License.

Contact

For questions or inquiries, please contact georgepap2001@gmail.com.