8000 brian7685 / Starred · GitHub

More Web Proxy on the site http://driver.im/

brian7685

Follow

brian7685

Follow

7 followers · 2 following

Achievements

Achievements

Stars

FocoosAI / focoos

🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.

Python 332 1 Updated Jul 6, 2025

VITA-MLLM / VITA

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,349 172 Updated Mar 28, 2025

jiwoogit / StyleID

[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer

Python 394 35 Updated Dec 16, 2024

FreeStyleFreeLunch / FreeStyle

FreeStyle : Free Lunch for Text-guided Style Transfer using Diffusion Models

Python 123 10 Updated May 21, 2024

facebookresearch / VLaMP

Code for “Pretrained Language Models as Visual Planners for Human Assistance”

Python 61 2 Updated Jun 12, 2023

wjun0830 / CGDETR

Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"

Python 133 16 Updated Aug 21, 2024

boheumd / A2Summ

The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)

Python 76 9 Updated Apr 24, 2023

GAIA-AIDA / object-detection

Object Detection component developed for the DARPA AIDA program.

Python 9 3 Updated Dec 8, 2022

rxtan2 / video-grounding-narrations

Python 12 Updated Mar 12, 2023

antoine77340 / S3D_HowTo100M

S3D Text-Video model trained on HowTo100M using MIL-NCE

Python 196 21 Updated Jul 3, 2020

roudimit / AVLnet

Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.

Python 52 6 Updated Mar 30, 2022

YuqingWang1029 / VisTR

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

Python 752 98 Updated Jul 15, 2021

thias15 / TrackingNet

Python 2 Updated Aug 9, 2018

ajabri / videowalk

Repository for "Space-Time Correspondence as a Contrastive Random Walk" (NeurIPS 2020)

Python 272 38 Updated Dec 11, 2021

senthilps8 / demystifyssl

Python 7 Updated Sep 4, 2021

PeihaoChen / RSPNet

Official Pytorch implementation for AAAI2021 paper (RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning)

Python 37 8 Updated Nov 5, 2021

salesforce / CAST

Python 19 3 Updated May 1, 2025

alirezazareian / ovr-cnn

A new framework for open-vocabulary object detection, based on maskrcnn-benchmark

Python 241 28 Updated Feb 11, 2023

facebookresearch / selavi

This repo covers the implementation for Labelling unlabelled videos from scratch with multi-modal self-supervision, which learns clusters from multi-modal data in a self-supervised way.

Python 116 15 Updated Apr 26, 2021

hildekuehne / Mining_YouTube_dataset

Python 4 2 Updated Mar 9, 2020

wvangansbeke / Unsupervised-Classification

SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]

Python 1,436 271 Updated Jul 27, 2023

jadore801120 / attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 9,271 2,029 Updated Apr 16, 2024

jiasenlu / NeuralBabyTalk

Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"

Python 525 123 Updated Mar 27, 2019

abisee / pointer-generator

Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"

Python 2,191 809 Updated Jun 16, 2022

zjuchenlong / sca-cnn.cvpr17

Image Captions Generation with Spatial and Channel-wise Attention

Python 209 73 Updated Apr 4, 2018

imatge-upc / salgan

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

Python 375 106 Updated Dec 7, 2022

tanya34fish / food_for_fun

Information Retrieval (IR, 2015) Final Project

Python 2 1 Updated Dec 26, 2015

0