8000 samulenzz / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View samulenzz's full-sized avatar

Block or report samulenzz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 6,190 409 Updated May 21, 2025

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript 14,201 1,171 Updated May 25, 2025

An open-sourced end-to-end VLM-based GUI Agent

Python 953 74 Updated Apr 4, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,057 35 Updated May 21, 2025

AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.

Python 708 68 Updated May 21, 2025
Python 2,928 263 Updated May 23, 2025

Examples and guides for using the Gemini API

Jupyter Notebook 13,063 1,757 Updated May 25, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,180 1,863 Updated Mar 26, 2025

💖🧸 A container of souls of AI waifu / virtual characters to bring them into our worlds, wishing to achieve Neuro-sama's altitude, completely LLM and AI driven, capable of realtime voice chat, Minec…

Vue 811 65 Updated May 25, 2025

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

JavaScript 36,571 2,296 Updated Apr 21, 2025

In-depth tutorials on LLMs, RAGs and real-world AI agent applications.

Jupyter Notebook 9,292 1,591 Updated May 24, 2025

Staging repo for development of native port of TypeScript

Go 20,093 621 Updated May 25, 2025

一个用于在 macOS 上平滑你的鼠标滚动效果或单独设置滚动方向的小工具, 让你的滚轮爽如触控板 | A lightweight tool used to smooth scrolling and set scroll direction independently for your mouse on macOS

Swift 16,743 551 Updated May 15, 2025

Master programming by recreating your favorite technologies from scratch.

Markdown 381,946 35,584 Updated Apr 11, 2025

Vision agent

Python 4,749 536 Updated May 19, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 141,708 11,867 Updated May 25, 2025

🎉Bridge of iOS Devices by usbmuxd. 基于usbmuxd的iOS调试工具。

Go 1,066 248 Updated Aug 27, 2024

wrapper for pymobiledevice3 to make it more easy to use.

Python 243 42 Updated Jul 2, 2024

Facebook WebDriverAgent Python Client Library (not official)

Python 1,797 279 Updated Dec 4, 2024

tidevice can be used to communicate with iPhone device

Python 2,493 463 Updated Sep 20, 2024

An invoice generator app built using Next.js, Typescript, and Shadcn

TypeScript 5,296 543 Updated May 25, 2025

SOTA Open Source TTS

Python 21,206 1,699 Updated Apr 12, 2025

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 34,905 3,337 Updated May 23, 2025

iOS Minicap provides a socket interface for streaming realtime screen capture data out of iOS devices.

C++ 442 131 Updated Aug 15, 2020

Instructions for mirroring iOS device on web browser

9 1 Updated Jan 16, 2019

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…

TypeScript 61,401 12,853 Updated May 26, 2025

📱 Display and control your Android device graphically with scrcpy.

JavaScript 5,074 375 Updated May 20, 2025

Android phone in browser with WebUSB + Adb

Vue 40 10 Updated Nov 4, 2022

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,477 1,153 Updated May 24, 2025

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools lik…

TypeScript 16,213 704 Updated May 25, 2025
Next
0