- Http://talkinginterfaces.org
- @Mhakkinen
- @mhakkinen@mastodon.social
- in/mhakkinen
Highlights
- Pro
Stars
Real-time webcam demo with SmolVLM and llama.cpp server
Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.
This project is a digital human that can talk and listen to you. It uses OpenAI's GPT to generate responses, OpenAI's Whisper to transcript the audio, Eleven Labs to generate voice and Rhubarb Lip …
A migraine tracker, built with Mavo. Work in progress, come back later.
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
Framework and toolkits for building and evaluating collaborative agents that can work together with humans.
A simple screen parsing tool towards pure vision based GUI agent
Official code of "MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation"
IBM Equal Access Accessibility Checker contains tools to automate accessibility checking from a browser or in a continuous development/build environment
Production-ready platform for agentic workflow development.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
The web framework for content-driven websites. ⭐️ Star to support our work!
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
The amp-up.io QTI 3 Stimulus Player Component ("QTI 3 Stimulus Player") is a 100% JavaScript component that aims to encapsulate the best practices and behaviors of the IMS Global QTI 3 Assessment S…
Home of the WebKit project, the browser engine used by Safari, Mail, App Store and many other applications on macOS, iOS and Linux.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Include is a tool built to make annotating for accessibility (a11y) easier—easier for designers to spec and easier for developers to understand what is required.
The qti3-item-player component ("QTI 3 Player") is a 100% JavaScript component that aims to encapsulate the best practices and behaviors of the IMS Global QTI 3 Assessment Item specification.
Contains documentation for Vispero software support of Web standards
team-strat, on GitHub, working in public. Current state: DRAFT
OpenACR is a digital native Accessibility Conformance Report (ACR). The initial development is based on Section 508 requirements. The main goal is to be able to compare the accessibility claims of …
Beautiful and accessible drag and drop for lists with React
A collection of pre-built speech synthesis settings used to convey emotion