Python 352 文档
|
4047
|
2019/04/18 19:46 |
Django 1.11.6 中文
|
1221
|
2018/11/27 14:49 |
docker 官方文档
|
1013
|
2018/03/18 17:41 |
Django 182 中文
|
885
|
2018/03/18 17:38 |
tensorflow 1.3 中文文档
|
869
|
2018/03/25 17:47 |
theano 0.9 中文文档
|
544
|
2018/03/18 17:41 |
NumPy v1.11 中文
|
465
|
2018/05/08 10:20 |
scipy-lecture-notes
|
434
|
2018/03/18 17:41 |
Deep Learning Tutorials
|
257
|
2018/03/18 17:42 |
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
|
151
|
2024/03/31 20:35 |
UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition
|
104
|
2024/05/11 15:02 |
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data
|
66
|
2024/05/09 16:07 |
NLTK with Python 3
|
66
|
2018/03/18 17:38 |
StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning
|
62
|
2024/06/20 18:57 |
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
|
41
|
2024/05/07 18:03 |
OLMo: Accelerating the Science of Language Models
|
36
|
2024/03/14 20:06 |
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
|
32
|
2024/03/21 18:59 |
Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study
|
31
|
2024/03/15 13:26 |
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
|
29
|
2024/03/11 16:48 |
A Survey on Recent Advances in LLM-Based Multi-turn Dialogue Systems
|
28
|
2024/03/30 15:51 |
Speech-based Slot Filling using Large Language Models
|
27
|
2024/05/09 11:33 |
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
|
26
|
2024/06/20 19:01 |
InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction
|
25
|
2024/05/11 11:19 |
Conformer: Convolution-augmented Transformer for Speech Recognition
|
23
|
2024/07/04 17:01 |
Beyond Language Models: Byte Models are Digital World Simulators
|
23
|
2024/03/14 08:41 |
Python 2.7.8 中文文档
|
23
|
2018/03/18 17:38 |
RFC 8725: JSON Web Token Best Current Practices
|
19
|
2024/05/15 23:03 |
OCR-free Document Understanding Transformer
|
19
|
2024/04/02 22:16 |
RAFT: Adapting Language Model to Domain Specific RAG
|
19
|
2024/03/18 14:37 |
On the Use of BERT for Automated Essay Scoring: Joint Learning of Multi-Scale Essay Representation
|
17
|
2024/05/28 19:35 |
WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit
|
15
|
2024/07/04 16:20 |
The Data Lakehouse: Data Warehousing and More
|
15
|
2024/03/14 22:19 |
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models
|
14
|
2024/04/25 19:10 |
SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents
|
13
|
2024/05/15 09:32 |
PaLM 2 Technical Report
|
10
|
2024/06/19 18:09 |
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
|
10
|
2024/05/31 14:25 |
Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
|
10
|
2024/04/25 17:32 |
Probing the Decision Boundaries of In-context Learning in Large Language Models
|
8
|
2024/06/25 12:06 |
RFC 9091: Experimental Domain-Based Message Authentication, Reporting, and Conformance (DMARC) Extension for Public Suffix Domains
|
8
|
2024/05/31 18:57 |
RFC 8765: DNS Push Notifications
|
8
|
2024/05/17 15:22 |
RFC 8744: Issues and Requirements for Server Name Identification (SNI) Encryption in TLS
|
8
|
2024/05/16 18:44 |
SRB measures for mostly expanding partially hyperbolic diffeomorphisms via the variational approach
|
8
|
2024/03/13 22:46 |
Attention Is All You Need 中文翻译
|
8
|
2018/04/26 20:04 |
DeepMind Control Suite
|
7
|
2024/06/25 17:02 |
Schema-Guided Dialogue State Tracking Task at DSTC8
|
7
|
2024/05/22 13:36 |
Attending to Graph Transformers
|
7
|
2024/04/07 16:23 |
Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning
|
7
|
2024/03/21 16:51 |
Are LLMs All You Need for Task-Oriented Dialogue?
|
6
|
2024/04/25 14:24 |
Shortcuts to adiabaticity: concepts, methods, and applications
|
6
|
2024/03/14 22:28 |
LoRA+: Efficient Low Rank Adaptation of Large Models
|
6
|
2024/03/14 10:16 |
Are prime numbers and quadratic residues random?
|
6
|
2024/03/13 22:39 |
Revisit Input Perturbation Problems for LLMs: A Unified Robustness Evaluation Framework for Noisy Slot Filling Task
|
5
|
2024/06/11 17:55 |
RFC 8772: The China Mobile, Huawei, and ZTE Broadband Network Gateway (BNG) Simple Control and User Plane Separation Protocol (S-CUSP)
|
5
|
2024/05/17 21:56 |
Alexa Conversations: An Extensible Data-driven Approach for Building Task-oriented Dialogue Systems
|
5
|
2024/04/25 17:43 |
Do language models plan ahead for future tokens?
|
5
|
2024/04/06 09:03 |
Speech Robust Bench: A Robustness Benchmark For Speech Recognition
|
5
|
2024/03/14 22:21 |
BookSQL: A Large Scale Text-to-SQL Dataset for Accounting Domain
|
4
|
2024/06/14 10:58 |
Hybrid Autoregressive Transducer (hat)
|
4
|
2024/05/23 14:12 |
HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent
|
4
|
2024/05/20 16:29 |
sphinx 1.6
|
4
|
2018/03/18 17:41 |
Towards Realistic Few-Shot Relation Extraction: A New Meta Dataset and Evaluation
|
3
|
2024/06/11 16:54 |
Task-Oriented Dialogue with In-Context Learning
|
3
|
2024/05/28 10:50 |
LoRA Learns Less and Forgets Less
|
3
|
2024/05/17 22:01 |
BootTOD: Bootstrap Task-oriented Dialogue Representations by Aligning Diverse Responses
|
3
|
2024/04/25 17:57 |
Sequence to Sequence Learning with Neural Networks
|
3
|
2024/04/02 19:19 |
Towards Real Smart Apps: Investigating Human-AI Interactions in Smartphone On-Device AI Apps
|
3
|
2024/03/30 09:10 |
Identity Matters in Deep Learning
|
3
|
2024/03/11 21:56 |
test1
|
3
|
2018/03/18 17:42 |
Controllable Time-Delay Transformer for Real-Time Punctuation Prediction and Disfluency Detection
|
2
|
2024/07/19 13:25 |
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages
|
2
|
2024/07/18 17:58 |
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
|
2
|
2024/06/26 09:39 |
CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
|
2
|
2024/06/24 11:35 |
The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
|
2
|
2024/06/19 11:29 |
Prompt Design and Engineering: Introduction and Advanced Methods
|
2
|
2024/06/07 10:36 |
RFC 8990: GeneRic Autonomic Signaling Protocol (GRASP)
|
2
|
2024/05/27 13:41 |
Autonomous Evaluation and Refinement of Digital Agents
|
2
|
2024/04/14 19:04 |
Wandering Within a World: Online Contextualized Few-Shot Learning
|
2
|
2024/03/30 09:53 |
How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?
|
2
|
2024/03/29 23:02 |
SimCSE: Simple Contrastive Learning of Sentence Embeddings
|
2
|
2024/03/23 21:40 |
Computational Scatter Correction for High-Resolution Flat-Panel CT Based on a Fast Monte Carlo Photon Transport Model
|
2
|
2024/03/14 22:03 |
Jinja2 2.9
|
2
|
2018/03/18 17:42 |
Efficient Monotonic Multihead Attention
|
1
|
2024/07/17 17:23 |
Make Your LLM Fully Utilize the Context
|
1
|
2024/06/18 17:10 |
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning
|
1
|
2024/06/18 17:03 |
CRAG -- Comprehensive RAG Benchmark
|
1
|
2024/06/10 22:35 |
Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?
|
1
|
2024/05/25 10:40 |
ZigMa: A DiT-style Zigzag Mamba Diffusion Model
|
1
|
2024/05/25 10:38 |
Predicting Emergent Abilities with Infinite Resolution Evaluation
|
1
|
2024/05/23 23:18 |
Retrieval-Augmented Generation with Knowledge Graphs for Customer Service Question Answering
|
1
|
2024/05/23 22:30 |
TA&AT: Enhancing Task-Oriented Dialog with Turn-Level Auxiliary Tasks and Action-Tree Based Scheduled Sampling
|
1
|
2024/04/25 17:49 |
GPT-4 Technical Report
|
1
|
2024/04/21 20:22 |
NetTrack: Tracking Highly Dynamic Objects with a Net
|
1
|
2024/04/21 14:50 |
Jamba: A Hybrid Transformer-Mamba Language Model
|
1
|
2024/04/01 22:15 |
Learning Actor Relation Graphs for Group Activity Recognition
|
1
|
2024/03/23 15:17 |
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
|
1
|
2024/03/16 14:31 |
ReCode: Robustness Evaluation of Code Generation Models
|
1
|
2024/03/16 09:54 |
An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition
|
1
|
2024/03/14 22:23 |
Graph Data Condensation via Self-expressive Graph Structure Reconstruction
|
1
|
2024/03/13 22:35 |
Graph Neural Networks for Scalable Radio Resource Management: Architecture Design and Theoretical Analysis
|
1
|
2024/03/11 22:49 |
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
|
1
|
2024/03/03 22:17 |
XLNet: Generalized Autoregressive Pretraining for Language Understanding
|
1
|
2024/02/29 23:20 |