Distilling the Knowledge in a Neural Network 论文翻译
Layer Normalization 论文翻译
Convolutional Neural Networks for Sentence Classification 论文翻译
XLNet 论文翻译
UNSUPERVISED DATA AUGMENTATION FOR CONSISTENCY TRAINING
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks 论文中文翻译
Gaussian Error Linear Units (GELUs) 论文中文翻译
BERT 中文翻译