weiyan-shi

🫨

Nonsense

Weiyan Shi weiyan-shi

🫨

Nonsense

23 followers · 16 following

Singapore University of Technology and Design
Singapore
14:06 (UTC +08:00)
https://weiyan-shi.github.io
in/shiweiyan

Achievements

Organizations

Lists (1)

Sort

weiyan's starred repos

Stars

OmniSVG / OmniSVG

OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to in…

Python 1,791 52 Updated May 26, 2025

lllyasviel / ControlNet

Let us control diffusion models!

Python 32,599 2,913 Updated Feb 25, 2024

excalidraw / excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 102,393 10,106 Updated Jun 23, 2025

Yushi-Hu / VisualSketchpad

Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Jupyter Notebook 230 12 Updated Oct 28, 2024

stair-lab / kg-gen

Knowledge Graph Generation from Any Text

Python 507 60 Updated Jun 5, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,864 2,526 Updated Aug 12, 2024

yaoyao-liu / minimal-light

A simple and elegant Jekyll theme for an academic personal homepage

CSS 819 710 Updated Apr 9, 2025

Chi-Loong / IDV

Interactive Data Visualization SUTD course

HTML 5 1 Updated May 5, 2025

googlecreativelab / quickdraw-dataset

Documentation on how to access and use the Quick, Draw! Dataset.

6,427 998 Updated Mar 11, 2025

magenta / magenta

Magenta: Music and Art Generation with Machine Intelligence

Python 19,531 3,784 Updated May 6, 2025

wangqiang9 / DiffSketching

About PyTorch implementation of DiffSketching: Sketch Control Image Synthesis with Diffusion Models, BMVC 2022

Python 33 4 Updated May 18, 2023

SihaoDong / SGID

5 Updated Feb 10, 2025

WenyanLiu / CCFrank4dblp

Displays the China Computer Federation (CCF) recommended rank of international conferences and journals in the dblp, Google Scholar, Connected Papers and and Web of Science search results.

JavaScript 703 51 Updated Dec 4, 2024

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,925 112 Updated Jun 16, 2025

ultralytics / yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 54,332 17,006 Updated Jun 21, 2025

zgchen33 / MCGaze

[IEEE SPL] End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction Context

Python 65 13 Updated Mar 18, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 16,384 1,753 Updated Jun 8, 2025

huggingface / datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 20,292 2,851 Updated Jun 23, 2025

jiahaoli57 / Call-for-Reviewers

This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals

880 35 Updated Jun 22, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 22,314 2,636 Updated Apr 30, 2025

lixin4ever / Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Jupyter Notebook 4,518 311 Updated Jan 24, 2025

e-apostolidis / PGL-SUM

A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE ISM 2021)

Python 89 35 Updated Jan 30, 2023

IntelLabs / GraVi-T

Graph learning framework for long-term video understanding

Python 63 9 Updated Jun 13, 2025

iangohy / SUTD-info

Useful links for SUTDents!

7 Updated Nov 7, 2021

KaiyangZhou / pytorch-vsumm-reinforce

Unsupervised video summarization with deep reinforcement learning (AAAI'18)

Python 494 150 Updated Dec 11, 2023

KaiyangZhou / vsumm-reinforce

AAAI 2018 - Unsupervised video summarization with deep reinforcement learning (Theano)

Python 140 35 Updated Nov 30, 2021

CMU-Perceptual-Computing-Lab / openpose

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C++ 32,669 7,979 Updated Aug 3, 2024

tylin / coco-caption

Jupyter Notebook 1,187 548 Updated May 13, 2024

ranjaykrishna / densevid_eval

Evaluation code for Dense-Captioning Events in Videos

Python 128 46 Updated Jun 11, 2019

mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,388 114 Updated Mar 29, 2025