8000 liweijia (Weijia Li) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View liweijia's full-sized avatar

Block or report liweijia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of 'DynamicEarthNet: Daily Multi-Spectral Satellite Dataset for Semantic Change Segmentation' [CVPR 2022]

Python 39 4 Updated Oct 12, 2022

Implementation of 'Coming Down to Earth: Satellite-to-Street View Synthesis for Geo-Localization' [CVPR 2021]

Python 52 4 Updated Jul 26, 2022

Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 2,720 177 Updated May 15, 2025

Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"

Python 54 Updated Apr 30, 2025

Official PyTorch implementation of SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World

Python 29 4 Updated May 19, 2025

✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 124 8 Updated Mar 4, 2025

GPT-ImgEval: Evaluating GPT-4o’s state-of-the-art image generation capabilities

Python 281 4 Updated May 3, 2025

The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"

Python 38 3 Updated Jun 11, 2025

FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis

Python 54 2 Updated Jun 11, 2025

This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, chemistry, physics, etc.). We call this method as Deep-Research.

95 9 Updated Mar 17, 2025

The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.

Shell 55 3 Updated Feb 15, 2025

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 1,368 110 Updated Apr 14, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,910 1,757 Updated Feb 26, 2025

Code for "Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation"

Jupyter Notebook 87 7 Updated May 19, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,389 2,237 Updated Feb 1, 2025

A C++/Python implementation of the StreetLearn environment based on images from Street View, as well as a TensorFlow implementation of goal-driven navigation agents solving the task published in “L…

C++ 310 63 Updated Jul 21, 2020

Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Survey

263 10 Updated Jun 20, 2025

[AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios”

Python 30 5 Updated Apr 10, 2025

Awesome-Remote-Sensing-Vision-Language-Models

171 10 Updated Apr 27, 2024

The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting [Arxiv]

Python 23 Updated Mar 29, 2024

The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”

Python 51 2 Updated Mar 18, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,608 1,015 Updated Jun 19, 2025

[ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”

Python 152 4 Updated Mar 31, 2025

This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.

43 Updated Dec 3, 2024

Awesome lists about framework figures in papers

820 22 Updated Jun 3, 2025

[ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.

Python 83 5 Updated Apr 29, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 36,407 2,960 Updated Jun 23, 2025

Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"

Python 332 59 Updated Jun 2, 2024
Next
0