A General Theoretical Paradigm to Understand Learning from Human Preferences
|
59
|
2024/02/27 23:09 |
Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
|
35
|
2024/02/21 16:11 |
ChatQA: Building GPT-4 Level Conversational QA Models
|
29
|
2024/02/22 12:00 |
A Closer Look at the Limitations of Instruction Tuning
|
20
|
2024/02/20 17:33 |
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels
|
15
|
2024/02/26 22:21 |
Zephyr: Direct Distillation of LM Alignment
|
13
|
2024/02/21 14:05 |
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
|
12
|
2024/04/11 09:36 |
Code Llama: Open Foundation Models for Code
|
6
|
2024/04/24 21:58 |
Datasets for Large Language Models: A Comprehensive Survey
|
6
|
2024/03/08 09:28 |
Conformer: Convolution-augmented Transformer for Speech Recognition
|
6
|
2024/03/02 11:07 |
DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
|
6
|
2024/02/26 22:34 |
Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive
|
6
|
2024/02/26 13:36 |
CaT: Balanced Continual Graph Learning with Graph Condensation
|
5
|
2024/04/10 23:01 |
Retrieval-Augmented Generation for AI-Generated Content: A Survey
|
5
|
2024/03/05 23:14 |
Concavity Properties of Solutions of Elliptic Equations under Conformal Deformations
|
4
|
2024/03/07 00:10 |
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
|
3
|
2024/03/03 22:39 |
dpo
|
3
|
2024/02/20 10:27 |
LM-Infinite: Zero-Shot Extreme Length Generalization for Large Language Models
|
2
|
2024/04/24 21:59 |
A flexible event reconstruction based on machine learning and likelihood principles
|
2
|
2024/03/04 22:49 |
Sequence to Sequence Learning with Neural Networks
|
2
|
2024/02/27 22:29 |
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
|
1
|
2024/04/10 23:17 |
Structured Pruning Learns Compact and Accurate Models
|
1
|
2024/04/10 17:39 |
RAFT: Adapting Language Model to Domain Specific RAG
|
1
|
2024/03/20 23:49 |
Jailbroken: How Does LLM Safety Training Fail?
|
1
|
2024/03/17 19:40 |
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
|
1
|
2024/02/03 10:04 |