8000 turned2670 (Jimmy) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View turned2670's full-sized avatar
  • School of Software
  • Tsinghua University

Highlights

  • Pro

Block or report turned2670

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fully open reproduction of DeepSeek-R1

Python 24,521 2,256 Updated May 22, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,578 553 Updated Apr 19, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,153 619 Updated Apr 27, 2025

[ICASSP 2024] The official repo for Harnessing the Power of Large Vision Language Models for Synthetic Image Detection

Python 29 2 Updated Jan 18, 2025
Python 915 68 Updated May 22, 2024
Python 27 6 Updated Jul 30, 2024

Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

Python 660 67 Updated Sep 19, 2024

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,925 324 Updated Jun 12, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 15,681 1,482 Updated Jan 19, 2025

Command-line program to download videos from YouTube.com and other video sites

Python 135,756 10,334 Updated May 4, 2025

Hate-CLIPper: Multimodal Hateful Meme Classification with Explicit Cross-modal Interaction of CLIP features - Accepted at EMNLP 2022 Workshop

Jupyter Notebook 52 11 Updated Apr 15, 2025

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 5,209 497 Updated Aug 6, 2024

Memes Processing Pipeline that enables the track of memes across multiple Web communities.

Python 57 18 Updated Mar 9, 2020
8000
Jupyter Notebook 6 Updated Jul 7, 2023

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,442 6,021 Updated May 21, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,585 2,491 Updated Aug 12, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,480 183 Updated Apr 2, 2025

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 282 8 Updated Nov 13, 2024
Python 87 13 Updated Jul 4, 2024

✨✨Latest Advances on Multimodal Large Language Models

15,272 987 Updated May 15, 2025

"他山之石、可以攻玉":复旦白泽智能发布面向国内开源和国外商用大模型的Demo数据集JADE-DB

Jupyter Notebook 408 24 Updated Mar 14, 2025

Stable Diffusion web UI

Python 152,732 28,412 Updated May 3, 2025
Python 13 Updated Jul 26, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 29,070 3,592 Updated Jul 23, 2024

Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge. https://arxiv.org/abs/2012.12975

Jupyter Notebook 60 19 Updated Feb 12, 2024
Next
0