8000 GitHub - xionxiao/kantv: study and practise state-of-the-art AI technology in real application/real scenario on Android device, powered by great multimedia project FFmpeg & great AI project Whisper.cpp
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
/ kantv Public
forked from kantv-ai/kantv

study and practise state-of-the-art AI technology in real application/real scenario on Android device, powered by great multimedia project FFmpeg & great AI project Whisper.cpp

License

Notifications You must be signed in to change notification settings

xionxiao/kantv

< 8000 div class="OverviewContent-module__Box_6--wV7Tw">
 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

KanTV

KanTV("Kan", aka English "watch") , an open source project focus on study and practise on-device AI technology in real scenario(such as perform online-TV playback and realtime transcription and online-TV record at the same time) on Android phone:

Software architecture of KanTV Android

Image

Building the project

Run Android APK on Android phone

  • Android 8.0(2017.08) --- Android 15(2024.10) and higher version with ANY mainstream arm64 mobile SoC.
  • Android smartphone equipped with ANY mainstream high-end mobile SoC is highly recommented for realtime AI-subtitle feature otherwise unexpected behavior would happen.
  • Android smartphone equipped with one of below Qualcomm mobile SoCs(Qualcomm's state-of-the-art high-end mobile SoC Snapdragon 8Gen3 series and Snapdragon 8Elite series are highly recommended) is required for verify/running ggml-hexagon backend on Android phone:
    Snapdragon 8 Gen 1
    Snapdragon 8 Gen 1+
    Snapdragon 8 Gen 2
    Snapdragon 8 Gen 3
    Snapdragon 8 Elite

Screenshots

here is a short video to demostrate realtime AI subtitle by running the great & excellent & amazing whisper.cpp on an Android phone equipped with Qualcomm Snapdragon 8Gen3 mobile SoC - fully offline, on-device.

realtime-subtitle-by-whispercpp-demo-on-xiaomi14-finetune-20240324.mp4

a screenshot to demostrate multi-modal inference by running the magic llama.cpp on an Android phone equipped with Qualcomm Snapdragon 8Elite mobile SoC - fully offline, on-device.

Image


a screenshot to demostrate realtime-video-recognition via MTMD from llama.cpp + a lightweight multimodal model SmolVLM2-256M from Huggingface on an Android phone equipped with Qualcomm Snapdragon 8Elite mobile SoC - fully offline, on-device.

Image

some other screenshots

    Image

    Image


    a screenshot to demostrate ASR inference by running the excellent whisper.cpp on an Android phone equipped with Qualcomm Snapdragon 8Gen3 mobile SoC - fully offline, on-device.

    Image


    a screenshot to demostrate Text-2-Image inference by running the amazaing stable-diffusion.cpp on an Android phone equipped with Qualcomm Snapdragon 8Elite mobile SoC - fully offline, on-divice.

    713992135


    a screenshot to demostrate download LLM model in APK.

    1213951738 1242080159

Docs

Contribution

Report issue in Android phone equipped with mainstream mobile SoC or submit PR to this project is greatly welcomed.

We use GitHub issues for tracking feature requests and issue reports, please see how to submit issue in this project .

Special Acknowledgement

    AI inference framework

About

study and practise state-of-the-art AI technology in real application/real scenario on Android device, powered by great multimedia project FFmpeg & great AI project Whisper.cpp

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 55.6%
  • C 22.1%
  • Java 21.9%
  • CMake 0.2%
  • GLSL 0.1%
  • Shell 0.1%
0