Skip to content

theelderemo/ai-audio-tools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI Audio Tools

Awesome Stars Last Commit PRs Welcome Tools

Typing SVG

I want to?

I want to... Go here
Make a full song from scratch Creation & Production
Clone or transform a voice AI Voice & Cover Generation
Remove vocals from a track Source Separation
Make a podcast Radio & Podcast
Use audio for health/medicine Health & Wellbeing
Build an audio AI app Development

Quick Navigation

Creation & Production Lyric Writing Voice Covers Separation
Mastering Plugins Analysis Health
Podcast Hearing Detection Speech
Transcription TTS Enhancement Development

Badge Key

free freemium paid api open-source enterprise vst hardware acquired


Creation & Production

  • VRS/A freemium - AI-powered lyric writing and music production workstation with multi-model Ghostwriter, Suno integration via browser extension, audio analysis, album art generation, and VRSA Studio (studio.vrsa.app) for a dedicated production environment.
  • Suno freemium - Generative AI music creation platform that allows users to create full songs, including vocals and instrumentation, from text prompts.
  • Soundry AI freemium - AI for Musicians, by Musicians.
  • Sonauto freemium - Create hit songs with AI.
  • Microphone Studio freemium - Multi-track recording without expensive studio equipment.
  • TuneFlow free open-source - Generate lyrics, melody, drum beats and more, while editing and mixing like any professional DAW.
  • CassetteAI freemium - AI powered music production platform: make lyrics, beats & vocals with AI then mix & publish straight from Cassette.
  • AIVA freemium - The Artificial Intelligence composing emotional soundtrack music.
  • beatoven.ai freemium - A simplified music creation tool that helps you create music for your videos and podcasts.
  • Infinite Album freemium - Adaptive AI music for gamers who livestream.
  • Epidemic Sound paid - High quality music and sound effects for all your content, all rights included.
  • Wonder paid - Dynascore: The world's first Dynamic Music Engine.
  • Amper acquired (Acquired by Shutterstock) - AI Music Composition Tools for Content Creators.
  • AudioStack paid api - AI-first platform for producing audio at scale.
  • mayk.it freemium - Your virtual music studio.
  • boomy freemium - Make instant music, share it with the world.
  • enote paid - Intelligent Sheet Music.
  • Qosmo - Qosmo is a group of artists, researchers, designers, and programmers.
  • AI Music acquired (Acquired by Apple) - Our music helps brands enable deeper connections with their audiences.
  • Splash HQ - The next generation of music producers.
  • musico - AI-driven software engine that generates music. It can react to gesture, movement, code or other sound.
  • Yousician freemium - The largest music educator on the planet.
  • Tape It free - App for songwriting & audio recording.
  • sessionwire paid - All-in-one online collaboration platform that delivers a seamless studio experience.
  • Aflorithmic paid api - Professional audio, voice, sound and music to scale.
  • Audio Design Desk paid - The Audio Solution for Video Editors.
  • Never Before Heard Sounds freemium - A music studio powered by AI.
  • NeuralDSP paid vst - Empowers music players by democratizing the access to world-class sound, through an intuitive software/hardware ecosystem.
  • Neutone free vst - AI audio plugin & community bridging the gap between AI research and creativity.
  • Udio freemium - AI music generator with full song creation from text prompts, integrated lyric editor, and granular line-by-line vocal control.
  • Mureka freemium - AI music generation with style-reference input, vocal timbre selection, and voice cloning for demos and song prototyping.
  • Soundverse freemium - Full-suite AI music studio with text-to-song, beat generation, stem separation, and SAAR — a voice-controlled music production assistant.
  • ACE Studio freemium - All-in-one AI music studio with expressive AI vocals, natural-sounding AI instruments, and a DAW bridge for Logic, Ableton, and FL Studio.
  • Stable Audio freemium - Text-to-audio and audio-to-audio generation for music and sound effects from Stability AI, trained on licensed datasets.
  • Riffusion free open-source - Diffusion model-based real-time music generation from text prompts, operating directly on audio spectrograms.
  • LoudMe freemium - Text-to-music generator for royalty-free songs and instrumentals with style and mood controls.
  • Ecrett Music freemium - Scene and mood-based AI background music generator aimed at video and content creators requiring instant scoring.
  • Soundful freemium - AI platform for generating royalty-free, high-quality soundtracks customizable by mood, tempo, and brand identity for commercial use.
  • SongGPT freemium - AI song generator for producing full tracks from short text prompts with genre selection.
  • Tunee freemium - AI music and lyric generation platform with access to multiple underlying generative models for varied output styles.
  • LOVO freemium api - Advanced text-to-speech and voice cloning platform for content creators, supporting emotional range control and voice actor-style production.

↑ Back to top


Lyric Writing & Songwriting

  • VRS/A freemium - AI-powered lyric writing and music production workstation with multi-model Ghostwriter, Suno integration via browser extension, audio analysis, album art generation, and VRSA Studio.
  • Lyric Studio freemium - Mobile-first AI songwriting ecosystem with a lyric editor, AI-generated verse/chorus drafts, rhyme suggestions, and song organization tools.

↑ Back to top

AI Voice & Cover Generation

  • Jammable freemium - (formerly Voicify AI) AI song cover generator with 22,000+ community-uploaded voice models and custom voice cloning from 10 minutes of audio.
  • Musicfy freemium - AI voice covers and voice cloning platform with text-to-music generation, voice-to-instrument conversion, and a large copyright-free vocal library.
  • Lalals freemium - AI voice swapping tool suite with 1,000+ voice options, stem splitting, and real-time conversion for remixes and vocal experimentation.

↑ Back to top

Source Separation

  • Music AI paid api - Professional AI stem separation and audio analysis platform for broadcasters and remixers, partnered with SourceAudio's 140+ broadcaster network.
  • TuneFlow free - A free DAW offering high quality vocal, drums, melody, bass stem separation, all-in-one audio separation, editing and vocal/instrument to MIDI transcription.
  • Spliter.ai freemium - AI Audio Processing.
  • Gaudio enterprise api - Redefine your audio experience in music/video streaming and virtual/augmented reality.
  • AudioShake paid api - An On-Demand Stem Creation Platform for the Music Industry.
  • Audionamix enterprise - Audio separation solutions for the entertainment industry.
  • vocali.se freemium - Separate vocals and music from any song, in seconds.
  • lalal.ai freemium - High-quality stem splitting based on the world's #1 AI-powered technology.
  • VocalRemover free - Separate voice from music out of a song free with powerful AI algorithms.
  • PhonicMind freemium - Separate vocals, drums, bass and other instruments out of your songs with HiFi AI.
  • EasySplitter freemium - AI-Based Vocal Remover Online for DJ Singers.
  • Remover.studio free - Vocal Remover & Online Karaoke.
  • MVSep free - Free separation of songs with many different algorithms (Demucs, MDX, UVR etc).
  • MuzLab freemium - Remove vocals from songs and split drums, bass and other instruments out of music.
  • Fadr freemium - Remove stems, convert to MIDI, and create high-quality remixes and mashups using AI tools.

↑ Back to top

Mastering, Mixing & Production Analysis

  • SoundBoost AI freemium - AI music mastering platform with goal-based controls — specify targets like loudness, warmth, or punch and the engine applies processing automatically.
  • VerifAI Audio freemium - Instant AI-driven feedback on track quality covering mixdown balance, loudness levels, bitrate, and other release-readiness metrics.

↑ Back to top

Plugins & Sample Tools

  • Samplab paid vst - AI VST plugin for granular audio sample editing, enabling note-level pitch manipulation of polyphonic audio with automatic chord progression detection.
  • Slooply freemium - AI-powered sample discovery platform with similarity search, mood/key/BPM filtering, MIDI export, and direct drag-and-drop DAW integration.
  • Atlas paid - AI sample library organizer with auto-tagging, similar-sound search, and a smart drum map interface for large sample collections.
  • Playbeat paid vst - AI generative groove sequencer for instant beat creation with MIDI export and real-time DAW sync.

↑ Back to top

Analysis & Recommendation

  • SONOTELLER freemium - AI music analysis tool for song lyric summarization, theme extraction, and musical feature identification.
  • Musicful freemium - AI-powered music recommendation and discovery engine focused on contextual and emotional matching.
  • Harmix api - AI music search with natural language, videos, similar audio and lyrics. Auto-tagging for audio and video.
  • AIMS paid api - AI-powered music similarity search & auto-tagging for anyone who makes music discovery their business.
  • FeedForward enterprise api - The intuitive audio search engine for audio & sound catalogues.
  • Aimi free - Discover the artists who freed their music from the shackles of songs and playlists.
  • Utopia Music enterprise - Fair Pay for Every Play.
  • Musiio acquired (Acquired by SoundCloud) - Use Artificial Intelligence to help automate your workflows.
  • niland acquired (Acquired by Spotify) - Build AI Powered Music Apps.
  • cyanite freemium api - AI for Music tagging and similarity search.
  • musicube acquired (Acquired by SongTradr) - B2B AI music metadata services like auto-tagging, metadata enrichment and semantic search.
  • Musixmatch freemium api - Algorithms and tools for music discovery, recommendation, and search based on lyrics.
  • hoopr paid - Find the best music, tell better stories, grow your audience.
  • Pex enterprise api - Music identification and copyright compliance. Audio fingerprinting, cover song identification in large scale.

↑ Back to top

Health & Wellbeing

  • Endel freemium - Personalized soundscapes to help you focus, relax, and sleep.
  • Lucid - Transforming music into medicine, using AI to compose and curate a personalized therapeutic music experience.
  • Wavepaths paid - Music for Psychedelic Therapy.
  • Suki enterprise - AI-powered voice solutions for healthcare.
  • audEERING enterprise api - Technology that can detect emotions and health information from the voice.
  • brain.fm freemium - Music to Focus Better.
  • SPOKE freemium - Lo-fi & Lyricism-led Mindfulness music episodes.
  • sona - Music as medicine. Research-based music for anxiety made by Grammy-winning producers.
  • Novoic enterprise - Using speech to detect neurological diseases.
  • Ubenwa enterprise - Infant health analysis based on cry signals.

↑ Back to top

Radio / Podcast

  • faidr free - Your favorite radio, interruption free.
  • fathom - The search engine for podcasts.
  • Nomono paid hardware - A self-contained recording kit for capturing interviews in the field.
  • Descript freemium - All-in-one audio & video editing, as easy as a doc.
  • auphonic freemium - Automatic audio post production web service for podcasts, broadcasters, radio shows, movies, screencasts and more.
  • SimonSays paid - Edit Video 5x Faster, Built For Teams.
  • Podcastle freemium - Studio-quality recording, AI-powered editing, and seamless exporting.
  • cleanvoice freemium - Removes filler sounds, stuttering and mouth sounds from your podcast or audio recording.
  • Super Hi-Fi enterprise - Artificial Intelligence Powered Music Experiences.

↑ Back to top

Hearing

  • Whisper.ai paid hardware - Smarter than your average hearing aid.
  • Eargo paid hardware - A Revolutionary New Hearing Aid.
  • Concha Labs hardware - Helping you hear more clearly.

↑ Back to top

Sound detection

  • Audio Analytic enterprise api - Creating exceptional human experiences through a greater sense of hearing.
  • SoundEye enterprise - Advanced sound recognition solutions capable of classifying sounds such as screaming, gunshot, coughing, and crying.
  • cochl api enterprise - A next-generation sound AI platform that understands any sounds like a human.
  • Josh.ai paid - A voice-controlled home automation system.
  • SEE SOUND paid - The world's first smart home hearing system.
  • Epigos.ai api - AI models that can be used to extract hidden data from audio sources.
  • HyperSurfaces enterprise - Seamlessly merging the physical and data worlds without the need for keyboards, buttons or touch screens.
  • HyperSentience enterprise - Delivers context awareness to phones, VR/AR headsets, smart watches, speakers and laptops.
  • Circulr Sound hardware - Smart audio wearables.
  • Securaxis enterprise - We turn sounds into information.
  • Deeply enterprise api - We add meaning to every sound in the world using advanced deep learning technology for sound event detection and context recognition.
  • Reef Pulse - Coral reef monitoring using bioacoustics and AI: sound event detection (boats, divers, waves, marine mammals, fishes, invertebrates) for impactful management of marine ecosystems.

↑ Back to top

Speech

Transcription

  • Ava freemium - Professional and AI-Based Captions for Deaf and HoH (Transcription & Diarization).
  • verbit enterprise - Professional AI-Based Transcription & Captioning.
  • otter freemium - Everything hybrid teams need for productive, collaborative meetings.
  • Trint paid - Audio Transcription Software — Speech to Text to Magic.
  • Rev paid - 99% accurate captions, transcripts, and subtitles.
  • voiceitt - An app for people with non-standard speech.
  • deepgram.com freemium api - Better voice applications with faster, more accurate transcription through AI Speech Recognition.
  • fireflies.ai freemium - AI assistant for your meetings.
  • SoapBox api enterprise - Speech technology that makes kids heard.
  • Amberscript freemium - SaaS solutions that automatically transform audio and video into text and subtitles using speech recognition.
  • Speaksee - Live captions what's being said during in-person group meetings.
  • Speechmatics api enterprise - Autonomous Speech Recognition technology that understands every voice.
  • sonix freemium - Automated transcription in 35+ languages.
  • Picovoice freemium api open-source - End-to-end Edge Voice AI, on-device voice recognition.
  • BoldVoice paid - Speak English clearly and confidently.
  • Gladia freemium api - Power your product with cutting-edge AI transcription, translation and audio intelligence using a single API.
  • Podsqueeze freemium - Re-purpose your audio or video podcast into transcript, show notes, blog post, video clips and other assets to publish and promote your show.

↑ Back to top

Synthesis (TTS)

  • adauris.ai freemium - Transforming written content into engaging audio with seamless distribution.
  • Aflorithmic paid api - Professional audio, voice, sound and music to scale.
  • Sonantic acquired (Acquired by Spotify) - Deliver compelling, lifelike performances with fully expressive AI-generated voices.
  • kroop AI - Harness synthetic media generation and detection with endless possibilities.
  • dubverse freemium - Make your content multilingual at a click of a button and reach more people.
  • Resemble.ai freemium api - Generate AI Voices that sound real.
  • Replica freemium - AI voice actors for games, film & the metaverse.
  • Respeecher paid - Voice Cloning for Content Creators.
  • amai - Ultra realistic text to speech voice engines.
  • AssemblyAI freemium api - Transcribe and understand audio with a single AI-powered API.
  • DAISYS - New voices that sound like real people.
  • WellSaid paid - Text-to-speech technology that creates life-like synthetic voices, from the voices of real people.
  • Deepsync - Generate audio content that exactly sounds like you.
  • coqui.ai open-source - Providing open speech tech for everyone.
  • Voiseed - AI-based Voice Engine able to mimic the emotions and prosody of human speech.
  • Speechki freemium - NLP-based text and audio editing platform with hundreds of AI voices inside.
  • Jellypod freemium - The AI podcast studio. Create customizable AI podcasts in minutes.
  • MiSynth - A brain-controlled instrument that uses synaptic technology and BCIs to turn imagined sounds into a synthesized MIDI instrument.
  • ElevenLabs freemium api - Developing the most compelling AI speech software for publishers and creators.
  • Wondercraft freemium - Wondercraft enables users to generate podcasts using Text-to-Speech technology.
  • play.ht freemium api - Building the future of content creation based on generative machine learning models.
  • Revocalize.ai freemium - Generate studio-quality AI Voices and train AI voice models from the web dashboard or the VST plugin.
  • morpheme.ai - Actor-First, Digital-Double Voices powered by the latest AI technology, ensuring they are efficient, authentic, and ethical.

↑ Back to top

Enhancement & Manipulation

  • Meaning - Streaming real-time voice and accent conversion.
  • VideoDubber freemium - Translating video/audio through voice cloning and accent conversion in 150+ languages.
  • krisp freemium - An AI-powered software solution for effective online meetings.
  • voicemod freemium - Free real-time voice changer.
  • audo freemium api - Noise cancellation products for creators, developers, and virtual meetings.
  • AudioTelligence enterprise api - Software that transforms the clarity and intelligibility of speech in challenging acoustic environments.
  • immersitech.io enterprise - We don't make audio. We make audio better.
  • utterly freemium - Noise removal for meetings and audio.
  • claerity.ai freemium - Cutting-edge AI to eliminate all background noise on video conference calls.
  • Neural Love freemium - Set of AI-powered tools to enhance audio quality.
  • HeardThat freemium - A smartphone app that turns your smartphone into a sophisticated speech-enhancement device.
  • Chatable freemium - A smartphone app that removes disruptive background noise.
  • BdSound enterprise - Intelligent Audio Solution for audio and voice-enabled products.
  • echosonic - Revolutionizing microphone by bringing Machine Learning capabilities into it.
  • Insoundz freemium - Generative AI Audio Enhancement.
  • Xound freemium - AI-powered audio enhancements in just one click. Grammarly for audio.

Development

Tools & SDKs

  • Quilio api - We maintain tools to help developers build real-time audio AI applications with ease.

Contributing

Fork the repo, edit the README, and open a PR.

Contributors

↑ Back to top