8000 brian7685 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View brian7685's full-sized avatar

Block or report brian7685

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.

Python 332 1 Updated Jul 6, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,349 172 Updated Mar 28, 2025

[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer

Python 394 35 Updated Dec 16, 2024

FreeStyle : Free Lunch for Text-guided Style Transfer using Diffusion Models

Python 123 10 Updated May 21, 2024

Code for “Pretrained Language Models as Visual Planners for Human Assistance”

Python 61 2 Updated Jun 12, 2023

Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"

Python 133 16 Updated Aug 21, 2024

The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)

Python 76 9 Updated Apr 24, 2023

Object Detection component developed for the DARPA AIDA program.

Python 9 3 Updated Dec 8, 2022

S3D Text-Video model trained on HowTo100M using MIL-NCE

Python 196 21 Updated Jul 3, 2020

Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.

Python 52 6 Updated Mar 30, 2022

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

Python 752 98 Updated Jul 15, 2021
Python 2 Updated Aug 9, 2018

Repository for "Space-Time Correspondence as a Contrastive Random Walk" (NeurIPS 2020)

Python 272 38 Updated Dec 11, 2021
Python 7 Updated Sep 4, 2021

Official Pytorch implementation for AAAI2021 paper (RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning)

Python 37 8 Updated Nov 5, 2021
Python 19 3 Updated May 1, 2025

A new framework for open-vocabulary object detection, based on maskrcnn-benchmark

Python 241 28 Updated Feb 11, 2023

This repo covers the implementation for Labelling unlabelled videos from scratch with multi-modal self-supervision, which learns clusters from multi-modal data in a self-supervised way.

Python 116 15 Updated Apr 26, 2021

SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]

Python 1,436 271 Updated Jul 27, 2023

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 9,271 2,029 Updated Apr 16, 2024

Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"

Python 525 123 Updated Mar 27, 2019

Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"

Python 2,191 809 Updated Jun 16, 2022

Image Captions Generation with Spatial and Channel-wise Attention

Python 209 73 Updated Apr 4, 2018

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

Python 375 106 Updated Dec 7, 2022

Information Retrieval (IR, 2015) Final Project

Python 2 1 Updated Dec 26, 2015
0