Stars
TradingAgents: Multi-Agents LLM Financial Trading Framework
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt, Created by 「云中江树」
拼好RAG:手搓并融合了GraphRAG、LightRAG、Neo4j-llm-graph-builder进行知识图谱构建以及搜索;整合DeepSearch技术实现私域RAG的推理;自制针对GraphRAG的评估框架| Integrate GraphRAG, LightRAG, and Neo4j-llm-graph-builder for knowledge graph construct…
11 Lessons to Get Started Building AI Agents
Production-ready platform for agentic workflow development.
Train a Language Model with GRPO to create a schedule from a list of events and priorities
Official repository of paper titled "UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities".
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
[TCSVT24] The implementation of paper "UMP: Unified Modality-aware Prompt Tuning for Text-Video Retrieval".
This repository contains the implementation of the method described in our paper, "Divide and Conquer: Isolating Normal-Abnormal Attributes in Knowledge Graph-Enhanced Radiology Report Generation".
Improving Chest X-Ray Report Generation by Leveraging Warm-Starting
Radiology Report Generation with Frozen LLMs
Official Code for Contrastive Learning with Counterfactual Explanations for Radiology Report Generation (ECCV 2024)
A Survey on CLIP in Medical Imaging
Offical code of Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training[ICML 2024]
[MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.
汇集了不同类型的医学成像相关的数据集,每一个数据集都有相关的介绍和下载地址处。“1.docx”为已经在github上传号的内容目录,未能上传的原因主要有:1.内存过大、2.需要填写申请邮件。
The official codes for "PMC-LLaMA: Towards Building Open-source Language Models for Medicine"
This repository is made for the paper: Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question Answering
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
A latent text-to-image diffusion model
Stable Diffusion web UI