-
Oregon State University
- https://huazhengwang.github.io/
Highlights
- Pro
Stars
Risk-Aware Preference-baser Reinforcement Learning (RA-PbRL)
Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Learn to Cooperate in Organized Teams".
LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking
Sample code and application showcases to get you going with AG2 (formally AutoGen)
AG2 (formerly AutoGen): The Open-Source AgentOS. Join us at: https://discord.gg/pAbnFJrkgZ
AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
This codebase is based on OLTR codebase
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Unbiased Learning To Rank Algorithms (ULTRA)
Code for the experiments of Matrix Factorization Bandit
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
Balancing Speed and Quality in Online Learning to Rank for Information Retrieval
axthorpe / BanditLib
Forked from HCDM/BanditLibLibrary of contextual bandits algorithms
Library of contextual bandits algorithms
Distributed skipgram mixture model for multisense word embedding
Distributed skipgram mixture model for multisense word embedding