8000 Luckydog-lhy (lhy) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Luckydog-lhy's full-sized avatar

Block or report Luckydog-lhy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,129 41 Updated May 21, 2025

Redis for LLMs

Python 1,239 183 Updated Jun 2, 2025

recursive rag with r1 reasoning

Python 307 38 Updated May 21, 2025

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Python 1,779 241 Updated May 27, 2025

The official implementation of RAR

Python 88 1 Updated Mar 27, 2024

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

876 40 Updated Apr 20, 2025
Python 9 1 Updated May 23, 2025

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Python 278 14 Updated Oct 7, 2024

QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.

Python 501 18 Updated Mar 27, 2025

Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced Reranking and Noise-injected Training.

Python 77 5 Updated Nov 15, 2024
Python 455 36 Updated May 29, 2025

Parsing-free RAG supported by VLMs

Python 722 57 Updated Feb 19, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 54,115 5,220 Updated May 30, 2025

Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision

164 4 Updated Apr 30, 2025

This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search".

Python 18 1 Updated Mar 2, 2025

ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

Python 479 36 Updated Mar 20, 2025
Jupyter Notebook 194 12 Updated Jul 5, 2024

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,859 1,749 Updated Feb 26, 2025

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,490 122 Updated Apr 17, 2024

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 1,933 82 Updated May 21, 2025
Python 4 Updated Feb 28, 2025

A jounery to real multimodel R1 ! We are doing on large-scale experiment

Python 305 8 Updated May 15, 2025

A fork to add multimodal model training to open-r1

Python 1,282 61 Updated Feb 8, 2025

s1: Simple test-time scaling

Python 6,419 749 Updated May 19, 2025

Train transformer language models with reinforcement learning.

Python 14,014 1,929 Updated Jun 2, 2025

Fully open reproduction of DeepSeek-R1

Python 24,643 2,280 Updated Jun 2, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,845 1,490 Updated Apr 24, 2025

Fast and memory-efficient exact attention

Python 17,633 1,716 Updated Jun 2, 2025
Next
0