8000 HAWLYQ (Anwen Hu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View HAWLYQ's full-sized avatar

Block or report HAWLYQ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 787 12 Updated May 15, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 25,015 2,225 Updated May 16, 2025

A paper list of some recent works about Token Compress for Vit and VLM

460 22 Updated May 15, 2025

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 2,285 198 Updated May 12, 2025

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 2,172 127 Updated Dec 24, 2024

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Python 440 51 Updated Jan 28, 2025

The official Meta Llama 3 GitHub site

Python 28,700 3,376 Updated Jan 26, 2025
Python 221 25 Updated Apr 23, 2024

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 4,224 422 Updated Apr 10, 2025

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

Python 94 2 Updated Jan 29, 2024

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 5,812 641 Updated Mar 19, 2025
Python 134 9 Updated Feb 13, 2024
Python 25 2 Updated Oct 8, 2023

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Python 10,361 972 Updated May 12, 2025
Python 14 Updated Sep 5, 2023

AutoBangumi - 全自动追番工具

Python 7,487 392 Updated Apr 29, 2025

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,970 177 Updated Aug 13, 2024

YuLan: An Open-Source Large Language Model

Python 623 58 Updated Jan 10, 2025

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

Python 296 11 Updated Jan 8, 2024

Narrative movie understanding benchmark

Python 70 Updated May 9, 2024

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)

Python 92 8 Updated May 8, 2023

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,478 183 Updated Apr 2, 2025

A Chinese Open-Domain Dialogue System

Python 321 27 Updated Aug 16, 2023

[NIPS2023] RRHF & Wombat

Python 808 47 Updated Sep 22, 2023

Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).

Python 182 53B2 13 Updated Jun 27, 2023

A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.

Jupyter Notebook 2,492 244 Updated May 6, 2025

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,246 690 Updated Aug 5, 2024

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 84,604 9,941 Updated May 13, 2025
15 Updated Apr 8, 2022

ICECAP code

Python 4 Updated Jul 8, 2021
Next
0