首页 24小时热榜最新最佳问答展示工作

返回个人资料

veryluckyxyz 的提交内容

科技回声

基于 Next.js 构建的科技新闻平台，提供全球科技新闻和讨论内容。

首页

首页最新最佳问答展示工作

资源链接

HackerNews API 原版 HackerNews Next.js

© 2025 科技回声. 版权所有。

1

A Case Study in CUDA Kernel Fusion

1 点作者 veryluckyxyz12 个月前

2

Lessons from the trenches on reproducible evaluation of language models

42 点作者 veryluckyxyz12 个月前

3

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

2 点作者 veryluckyxyz大约 1 年前

4

Zero-Shot Tokenizer Transfer

2 点作者 veryluckyxyz大约 1 年前

5

An Empirical Model of Large-Batch Training

2 点作者 veryluckyxyz大约 1 年前

6

Gradient Diversity: A Key Ingredient for Scalable Distributed Learning

3 点作者 veryluckyxyz大约 1 年前

7

Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models

1 点作者 veryluckyxyz大约 1 年前

8

Automatically Detecting Under-Trained Tokens in Large Language Models

182 点作者 veryluckyxyz大约 1 年前

9

Large Language Models for Data Annotation: A Survey

2 点作者 veryluckyxyz大约 1 年前

10

Refusal in LLMs is mediated by a single direction

110 点作者 veryluckyxyz大约 1 年前

11

Automated Multi Agent Chat

2 点作者 veryluckyxyz大约 1 年前

12

Orca: A Distributed Serving System for Transformer-Based Generative Models

3 点作者 veryluckyxyz大约 1 年前

13

Understanding Emergent Abilities of Language Models from the Loss Perspective

2 点作者 veryluckyxyz大约 1 年前

14

LoRA+: Efficient Low Rank Adaptation of Large Models

181 点作者 veryluckyxyz大约 1 年前

15

Does Transformer Interpretability Transfer to RNNs?

3 点作者 veryluckyxyz大约 1 年前

16

MiniCPM: Potential of Small Language Models W Scalable Training Strategies

2 点作者 veryluckyxyz大约 1 年前

17

Building BerkeleyDB

2 点作者 veryluckyxyz大约 1 年前

18

Rotational Equilibrium: How Weight Decay Balances Learning Across NeuralNetworks

2 点作者 veryluckyxyz大约 1 年前

19

Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference

3 点作者 veryluckyxyz大约 1 年前

20

Bad arguments against a universal basic income

6 点作者 veryluckyxyz将近 9 年前

← 上一页下一页 →