10000 weiyan-shi (Weiyan Shi) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View weiyan-shi's full-sized avatar
🫨
Nonsense
🫨
Nonsense

Organizations

@Cheer-for-fun

Block or report weiyan-shi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to in…

Python 1,791 52 Updated May 26, 2025

Let us control diffusion models!

Python 32,599 2,913 Updated Feb 25, 2024

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 102,393 10,106 Updated Jun 23, 2025

Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Jupyter Notebook 230 12 Updated Oct 28, 2024

Knowledge Graph Generation from Any Text

Python 507 60 Updated Jun 5, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,864 2,526 Updated Aug 12, 2024

A simple and elegant Jekyll theme for an academic personal homepage

CSS 819 710 Updated Apr 9, 2025

Interactive Data Visualization SUTD course

HTML 5 1 Updated May 5, 2025

Documentation on how to access and use the Quick, Draw! Dataset.

6,427 998 Updated Mar 11, 2025

Magenta: Music and Art Generation with Machine Intelligence

Python 19,531 3,784 Updated May 6, 2025

About PyTorch implementation of DiffSketching: Sketch Control Image Synthesis with Diffusion Models, BMVC 2022

Python 33 4 Updated May 18, 2023
5 Updated Feb 10, 2025

Displays the China Computer Federation (CCF) recommended rank of international conferences and journals in the dblp, Google Scholar, Connected Papers and and Web of Science search results.

JavaScript 703 51 Updated Dec 4, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,925 112 Updated Jun 16, 2025

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 54,332 17,006 Updated Jun 21, 2025

[IEEE SPL] End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction Context

Python 65 13 Updated Mar 18, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 16,384 1,753 Updated Jun 8, 2025

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 20,292 2,851 Updated Jun 23, 2025

This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals

880 35 Updated Jun 22, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 22,314 2,636 Updated Apr 30, 2025

Acceptance rates for the major AI conferences

Jupyter Notebook 4,518 311 Updated Jan 24, 2025

A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE ISM 2021)

Python 89 35 Updated Jan 30, 2023

Graph learning framework for long-term video understanding

Python 63 9 Updated Jun 13, 2025

Useful links for SUTDents!

7 Updated Nov 7, 2021

Unsupervised video summarization with deep reinforcement learning (AAAI'18)

Python 494 150 Updated Dec 11, 2023

AAAI 2018 - Unsupervised video summarization with deep reinforcement learning (Theano)

Python 140 35 Updated Nov 30, 2021

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C++ 32,669 7,979 Updated Aug 3, 2024
Jupyter Notebook 1,187 548 Updated May 13, 2024

Evaluation code for Dense-Captioning Events in Videos

Python 128 46 Updated Jun 11, 2019

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,388 114 Updated Mar 29, 2025
Next
0