8000 GitHub - Gilliaaan/KaraFan: The BEST music separation model with help of A.I. ... to my ears ! πŸ‘‚πŸ‘‚
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

The BEST music separation model with help of A.I. ... to my ears ! πŸ‘‚πŸ‘‚

License

Notifications You must be signed in to change notification settings

Gilliaaan/KaraFan

Β 
Β 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

50 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🎡 KaraFan

The BEST music separation software with help of A.I. ... to my ears ! πŸ‘‚πŸ‘‚

As you've guessed, it was made specially for Karaoke (▢️ focus on musical part).
This project is open to all goodwill.

The Discord community, Chat-GPT & GitHub Copilot ... and my human intelligence made this project possible.

KaraFan

KaraFan works with Google Colab or if you have a decent GPU, on Your PC with Visual Studio Code or with a command line in a shell !!

  I'm in love with Copilot ! 😍

πŸ’² DONATE

If you want to encourage me to give more time to improve this project :

Β 
Β 

You can become one of my patrons :

Patreon

And if you have some Bitcoins (or cryptos) that you don't know what to do with,
click here to send me an email, I'll give you my wallet address :

Bitcoin 😍

You can reach me by email or Join all of us on Discord this Discord server !

πŸ“– Table of Contents

🧻 in the WIKI now

So for now, in addition to being obliged to R.T.F.R.M (Read The Fucking READ ME), you have to R.T.F.W (Read The Fucking WIKI) !! πŸ˜›

~


🎰 SUCCESS STORY

This 12th of September 2023, I've receveid my first compliment from one of KaraFan's users : SCRFilms (on Discord) :

It's weird why Source Separation Ratio (SSR) doesn't exist, it is also a big part of defining quality.
Like how well the model separates from it's original signal, how often it false detects other instruments, and stuffs.
I can tell not all model are the same and have different SSRs,
but tbh, if talking about source separation ratio, the KaraFan wins from ALL models
Salute.
...
eyyy, I find it cool tho where it actually mute the signal when there is no vocals detected.
MDX23C 8k FFT usually have around -70dB separation ratio, but for KaraFan, it's beyond 120dB or in short mutes the signal which is insane !

We were talking about SDR score that define the quality of the separation.
... and Β« MDX23C 8k FFT Β» was actually the BEST model with high SDR score !!

But ... I'm only talking aboout HIFI quality for my ears ... once again.

And also :

I answered Bas Curtiz (who made the SDR tests, thanks to him !) in private message :

I could trick the SDR algo if I would πŸ˜‰

Remember all of you : I'm a (white hat) hacker ...


πŸ”₯ INTRODUCTION

I'm an amateur Rock singer who has often been disappointed by the inability to find songs of my favorite singers in the vast KAR databases I possess, or in the extensive library of thousands of songs offered by Β« KaraFun Β» (with a paid subscription).

Of course, you have all the singers's best-of, but it's not always the songs you want to sing.

And if the singer is not very famous, you'll find 3-4 songs with real musicians playing or you will have to sing on a MIDI file, which is not very pleasant (even with "Sound Fonts"), or you can forget it ... until ...

πŸ’€ AN OLD DREAM

Since my childhood, I sing everytime and I've been dreaming of a software that would allow me to remove the voice of a song to sing on it.

TODAY, I'M 54 YEARS OLD, I'M STILL SINGING, AND MY DREAM HAS COME TRUE !

Now, I have the ability to create my own Β« KFN Β» files for my favorite songs, featuring real musicians who played on the original track, and use them with my beloved software : Β« KaraFun Β» πŸŽ€πŸ’‹

KaraFun is a free software that you can use to play Karaoke files on your PC.
They have subscription plans to access their huge library of songs, but you can also use for free with your own MID, KAR, CDG & KFN files !!

Perhaps it's time to turn on your dreams too ... ?

πŸ§’ BIOGRAPHY

Programmer since the age of 12 (1981).
Before, I developed in ASM, C, C++, Basic, Visual Basic.
Since the year 2000, I have been coding in PHP, MySQL, JavaScript, jQuery, HTML, CSS.

And today in Python. ❀️

~


πŸ“† HISTORY

I first started with the Demucs facebook research model, but I was disappointed with the results, especially with the instrumental part.

Then I discovered the MDX model, and I was amazed by the quality of the results, especially with the vocals and the instrumental part.

But I was still disappointed with the instrumental part, which was not shining enough for me.

So I decided to create my own process, based on the MDX models, but with the best instrumental that I could get.
(Remembers : It's for Karaoke !).

I also added a few tricks to improve the quality of the results.

πŸ“ˆ CHANGES LOG

Date Version Description
2023-09-15 1.4 Totally Rewamped INSTALLATION
2023-09-09 1.3 Correct "Linkwitz-Riley" filter (better SRS)
New download system for Models (stay in GUI)
Add TEST MODE (process with only 1 pass)
Add "Delete" buttons (works with ONE file and DEBUG & GOD_MODE activated)
2023-09-08 1.2 Possibility to choose any MDX A.I. models for Instrum & Vocals
New download system for Models (stay in GUI)
Saving MP3 with fucking Cut-OFF @ 16 Khz solved !
2023-09-06 1.1 Bugs Fixed with the returns of Discord's people
One can choose the ouput directory
KaraFan have Automagic Updates system !
2023-09-05 1.0 First version dropped on Discord (buggy)
2023-09-03 0.8 Post the first R.T.F.R.M. (Read The Fucking "READ ME") on GitHub
2023-08-19 0.5 - Inform people on Discord that I'm working on a it...
Β« KaraFan Β» was born ! πŸŽ‰πŸŽ‰πŸŽ‰
2023-08-03 0.1 I started to dig infos on Discord to find the best models & process to use
(follow this link if you wanna read all the story since the beginning ...)
2023-07-12 0.0 I found & tried the wonderful Facebook Demucs models, and the idea started to grow in my mind ...

~


πŸ¦„ NEAR FUTURE

You will see ... I have hundreds of ideas, but I need time to implement them.

And as I'm an independent developer, I have to work to earn my living.

So if you want to encourage me to give more time to improve this project,
you can make a donation or become one of my patrons :

🎡 Go back to the top of this page 🎡 where you can find the donation buttons.

This helps me to :

  • Pay calculation time on Google Colab (100 units = 11 €, I eat all in 3-4 days)
  • Perhaps, buy new hardware to make more and more tests to improve the quality of the results.
    I'm currently using a 4 years old laptop with a GTX 1060 Ti 4GB
    I'm dreaming of a RTX 4090 ... and I'm NOT a gamer !
    ... and if you are too generous, Me can buy an Nvidia A100 40GB !! 😍

~


😍 WALL OF FAME

My beloved Patrons & Donators :

  • Your name here !!

~


πŸ“ TODO LIST

My brain is overheating ! 😡

  • Add more MDX Models.
  • Add a TEST option to quickly do only 1 pass on audio files for testing purposes.
  • Add a Β« STOP Β» button to kill Colab background processes that consume your credits !!
  • Get weak voices with A.I. models with a pre-amplification of low parts of the spectrum.
  • Automatic fine-tuning of volume compensation for each song (am I Crazy ??).
  • Separate Choirs from Vocals (very hard challenge !)
  • etc ... etc ... etc ...

~


πŸ’— SPECIAL THANKS TO...

All these wonderful people without whom this project would not have been possible :

βœ”οΈ Jarredou - For his multiples Colab sharing, for his really helpful hand on Discord, and from whom I've stolen ALL the code of inference.py to create my own GUI with it !! πŸ‘Ό

βœ”οΈ Deton 24 - For helping hand and for his very BIG documentation about Music Separation, that you can find HERE ! Read it !!
βœ”οΈ Bas Curtiz - For his Youtube video : Tips & Tricks - Inverting that confort my FIRST intuition (after thousands tests πŸ˜›) about the importance of cascading the models to improve the quality of the results.
βœ”οΈ Roman Solovyev (aka ZFTurbo) - For sharing his code that serve as base to Jarredou & Deton 24 to create their own code.
And it's also the creator of MVSEP.com, where you can follow the SDR evolution of the MDX models : Quality Checker !

βœ”οΈ Kuielab - Developed the original MDX-Net AI code
βœ”οΈ Anjok07 & DilanBoskan - Authors of Ultimate Vocal Remover GUI from where ZFTurbo has stolen some parts of code to create his own... πŸ™„
βœ”οΈ Kimberley Jensen - Training the MDX-Net AI models, especially Β« Kim Vocal 2 Β» (that won the 3rd place in the Β« MDX-C 2022 Β» challenge)
βœ”οΈ Facebook Research Demucs - For giving me the first steps & the desire to start this project ...

We are all standing on the shoulders of giants !

Thousands Thanks to all of you ! I love you ! ❀️


πŸ“œ LICENSE

Β© Captain FLAM & Jarredou - 2023 - M.I.T license

That means you can do whatever you want with this code, but YOU HAVE TO mention my name and the fact that I'm the original author of this code, and mention the names of all the people who have contributed to this project.

YOU HAVE TO keep the original license file in your project, and keep the original header with copyrights in each source file.


About

The BEST music separation model with help of A.I. ... to my ears ! πŸ‘‚πŸ‘‚

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 93.5%
  • Jupyter Notebook 6.1%
  • Batchfile 0.4%
0