-
G42
- Abu Dhabi
-
10:24
(UTC +04:00) - @alielfilali01
- in/alielfilali01
- https://huggingface.co/alielfilali01
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Anthropic's educational courses
[NeurIPS 2021] "Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models" by Boxin Wang*, Chejian Xu*, Shuohang Wang, Zhe Gan, Yu Cheng, Jianfeng Gao, Ahmed Hassan Awad…
A framework for pitting LLMs against each other in an evolving library of games ⚔
huggingface / yourbench
Forked from sumukshashidhar/yourbench🤗 Benchmark Large Language Models Reliably On Your Data
A Benchmark and Evaluation framework for evaluating Arabic LLM safeguards
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
🤗 smolagents: a barebones library for agents that think in code.
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
Finetuning Large Language Models on One Consumer GPU in 2 Bits
Minimalistic 4D-parallelism distributed training framework for education purpose
Large Concept Models: Language modeling in a sentence representation space
Code release for Best-of-N Jailbreaking
Reinforcement Learning Tutorials & other bedtime stories in PyTorch
Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders
أسئلة باللغة العربية تركز على الثقافة السعودية تم اختبارها على عدد من النماذج اللغوية الضخمة LLMs
A course on aligning smol models.
WildEval / ZeroEval
Forked from allenai/WildBenchA simple unified framework for evaluating LLMs
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
The Paper List on Data Contamination for Large Language Models Evaluation.
A reading list on LLM based Synthetic Data Generation 🔥