8000 Nayuta403 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Nayuta403's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@bytedance @LianjiaTech @cfug @fluttercandies

Block or report Nayuta403

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🐱 跨平台桌宠 BongoCat,为桌面增添乐趣!

TypeScript 5,721 269 Updated Jun 1, 2025

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 4,286 431 Updated Apr 10, 2025
Python 6,288 414 Updated May 21, 2025

Brings the iOS scrolling experience to Android.

Java 132 5 Updated Sep 16, 2023

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,600 555 Updated Apr 19, 2025

The model, data and code for the visual GUI Agent SeeClick

HTML 378 18 Updated Nov 22, 2024

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1,270 86 Updated May 29, 2025

2d 纯计算高性能刚体物理引擎

TypeScript 77 13 Updated Mar 20, 2022

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

53,759 5,709 Updated Jun 1, 2025

Towards Large Multimodal Models as Visual Foundation Agents

Python 216 6 Updated Apr 24, 2025

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 388 28 Updated Apr 30, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,284 1,872 Updated Mar 26, 2025

🔥🔥 btrace(AKA RheaTrace) is a high performance Android trace tool which is based on Perfetto, it support to define custom events automatically during building apk and using bhook to provider more n…

Kotlin 2,021 283 Updated May 15, 2025

VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automation in a step-by-step manner.

Python 76 11 Updated Feb 17, 2025

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

703 45 Updated Jun 1, 2025

AndroidWorld is an environment and benchmark for autonomous agents

Python 323 45 Updated May 22, 2025

🔥Android无障碍服务(AccessibilityService)开发框架,Android自动化脚本框架,快速开发复杂自动化任务、远程协助、监听等

Kotlin 509 146 Updated May 30, 2025

Vreo (VR Video 缩写) 是基于如视三维渲染引擎 Five 和 用户界面构建库 React 实现的如视 3D 空间剧本播放器。

TypeScript 33 9 Updated May 30, 2025
HTML 5 2 Updated Apr 9, 2024

Android 技术中台,但愿人长久,搬砖不再有

Java 6,642 1,386 Updated Sep 10, 2022

An input-component for controlling your app in natural language using an LLM though LangChain.dart

Dart 13 4 Updated Nov 1, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,562 429 Updated May 29, 2024

Paper list for Personal LLM Agents

388 20 Updated May 8, 2024

Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"

Python 359 52 Updated Mar 22, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,291 691 Updated Aug 5, 2024

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 5,851 646 Updated Mar 19, 2025

Modular and customizable Material Design UI components for Android

Java 16,795 3,138 Updated May 30, 2025

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,673 695 Updated Jun 2, 2025

Real-Time audio processing library written in Dart.

C 112 15 Updated Jul 18, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 108,646 17,688 Updated Jun 2, 2025
Next
0