一译 —— 文档和论文翻译、对照阅读、讨论和社区

Evolutionary Neural Architecture Search for Transformer in Knowledge Tracing

Knowledge tracing (KT) aims to trace students' knowledge states by predicting whether students answer correctly on exercises. Despite the excellent performance of existing Transformer-based KT approac ...

0 0 0 2025/07/05 arXiv:2310.01180v1 乐乐

OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning

The Open Whisper-style Speech Models (OWSM) project has developed a series of fully open speech foundation models using academic-scale resources, but their training data remains insufficient. This wor ...

0 0 0 2025/07/05 arXiv:2506.00338v1 luffy

SiPipe: Bridging the CPU-GPU Utilization Gap for Efficient Pipeline-Parallel LLM Inference

As inference workloads for large language models (LLMs) scale to meet growing user demand, pipeline parallelism (PP) has become a widely adopted strategy for multi-GPU deployment, particularly in cros ...

0 0 0 2025/07/05 arXiv:2506.22033v1 大写的P和大写的G

Task Placement and Resource Allocation for Edge Machine Learning: A GNN-based Multi-Agent Reinforcement Learning Paradigm

Machine learning (ML) tasks are one of the major workloads in today's edge computing networks. Existing edge-cloud schedulers allocate the requested amounts of resources to each task, falling short of ...

0 0 0 2025/07/05 arXiv:2302.00571v2 fangry

Reangle-A-Video: 4D Video Generation as Video-to-Video Translation

我们介绍了Reangle-A-Video，这是一个统一的框架，用于从单个输入视频中生成同步的多视频视频。与大规模4D数据集上训练多视频视频扩散模型的主流方法不同，我们的方法将多视频视频生成任务重新塑造为视频到视频翻译，利用公开可用的图像和视频扩散先验。从本质上讲，Reangle-A-Video分为两个阶段 ...

0 0 0 2025/07/05 arXiv:2503.09151v2 陆三七

RevGNN: Negative Sampling Enhanced Contrastive Graph Learning for Academic Reviewer Recommendation

获取审阅者进行学术提交是一个具有挑战性的建议方案。最近以图形学习为驱动的模型在推荐领域取得了显着进步，但是他们在学术审稿人推荐任务中的表现可能会遭受重大的假负问题。这是由于未观察到的边缘代表负样本的假设 ...

0 0 0 2025/07/05 arXiv:2407.20684v1 siweima

A Contrastive Framework with User, Item and Review Alignment for Recommendation

为用户和项目学习有效的潜在表示是推荐系统的基石。传统方法依靠用户项目的交互数据将用户和项目映射到共享的潜在空间中，但是交互的稀疏通常会带来挑战。尽管利用用户评论可以减轻这种稀疏性，但现有的评论意见推荐模型通常显示出两个关键局限性 ...

0 0 0 2025/07/05 arXiv:2501.11963v2 siweima

Triformer: Triangular, Variable-Specific Attentions for Long Sequence Multivariate Time Series Forecasting--Full Version

各种现实世界的应用程序都依靠未来的信息来做出决策，因此要求有效，准确的长序列多元时间序列序列预测。尽管最近基于注意力的预测模型在捕获长期依赖方面表现出强大的能力，但它们仍然受到两个关键局限性。首先，规范的自我注意力具有二次复杂性w ...

0 0 0 2025/07/05 arXiv:2204.13767v1 dyw

来一起翻译吧！

为了您和其他读者获得更好的阅读体验，请您勇敢地改进翻译，特别是一些显而易见的机器翻译错误。

虽然我们追求卓越，但我们并不要求翻译十全十美，因此请不要担心您翻译有误 —— 我们的服务器已经记录所有的翻译，您不必担心会因为您的失误导致无法挽回的破坏。（改编自维基百科）