TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Back to Profile
Submissions by veryluckyxyz
1
Understanding Perception and Reasoning Through Model Merging
2 points
by
veryluckyxyz
4 days ago
no comments
2
Building and better understanding vision-language models (2024)
2 points
by
veryluckyxyz
9 days ago
no comments
3
HF smolagents computer-agent demo
1 points
by
veryluckyxyz
12 days ago
no comments
4
Do Reasoning Models Show Better Verbalized Calibration?
2 points
by
veryluckyxyz
29 days ago
no comments
5
Robustly identifying concepts introduced during chat fine-tuning with crosscoder
6 points
by
veryluckyxyz
about 1 month ago
no comments
6
Retrieval with Learned Similarities
3 points
by
veryluckyxyz
about 2 months ago
no comments
7
The Curse of Depth in Large Language Models
1 points
by
veryluckyxyz
about 2 months ago
no comments
8
Looking Back at Speculative Decoding
36 points
by
veryluckyxyz
3 months ago
5 comments
9
Long-Context GRPO
60 points
by
veryluckyxyz
3 months ago
22 comments
10
HippoRAG: Neurobiologically Inspired Long-Term Memory for LLMs (2024)
65 points
by
veryluckyxyz
3 months ago
4 comments
11
Learning to Plan and Reason for Evaluation with Thinking-LLM-as-a-Judge
1 points
by
veryluckyxyz
4 months ago
no comments
12
Process Reinforcement Through Implicit Rewards
1 points
by
veryluckyxyz
5 months ago
no comments
13
Explaining Large Language Models Decisions Using Shapley Values
89 points
by
veryluckyxyz
5 months ago
19 comments
14
Phi-4 Technical Report
2 points
by
veryluckyxyz
5 months ago
no comments
15
Alignment Faking in LLMs [pdf]
2 points
by
veryluckyxyz
5 months ago
1 comment
16
What Makes Rotary Positional Encodings Useful?
1 points
by
veryluckyxyz
6 months ago
no comments
17
Rethinking Softmax: Self-Attention with Polynomial Activations
2 points
by
veryluckyxyz
7 months ago
no comments
18
Post-Training Layer Scaling Prevents Forgetting and Enhances Model Merging
1 points
by
veryluckyxyz
7 months ago
no comments
19
Random Matrix Theory in Machine Learning Tutorial
2 points
by
veryluckyxyz
8 months ago
no comments
20
Rerankers: A Lightweight Python Library to Unify Ranking Methods
1 points
by
veryluckyxyz
8 months ago
no comments
21
Double Descent Demystified
1 points
by
veryluckyxyz
8 months ago
no comments
22
Synthetic Continued Pretraining
3 points
by
veryluckyxyz
8 months ago
no comments
23
Bright: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
1 points
by
veryluckyxyz
10 months ago
no comments
24
Artificial needles to real haystacks: Improving retrieval capabilities in LLMs
101 points
by
veryluckyxyz
11 months ago
21 comments
25
From Decoding to Meta-Generation: (LLMs)
2 points
by
veryluckyxyz
11 months ago
no comments
26
Warp: On the Benefits of Weight Averaged Rewarded Policies
2 points
by
veryluckyxyz
11 months ago
no comments
27
Experiments in Weak-to-Strong Generalization
1 points
by
veryluckyxyz
11 months ago
no comments
28
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
3 points
by
veryluckyxyz
12 months ago
no comments
29
A Case Study in CUDA Kernel Fusion
1 points
by
veryluckyxyz
12 months ago
no comments
← Previous
Next →