Blog

Guides, tutorials, and insights on AI coding tools and API providers.

· 1 min read

“评测即科学”:首篇大语言模型评测的综述,一文带你了解大 (English)

> Generated: 2026-06-23 06:53:13 --- Alright, let me first walk you through the facts, and then I'll rewrite it properly. A few things need to be corrected:

Read more →
· 7 min read

可解释性:从频域角度解释卷积解码神经网络的表达瓶颈 (English)

> Generated: 2026-06-23 06:15:36 --- Believe it or not, three years ago I ran an experiment, and even now, thinking about it sends a chill down my spine

Read more →
· 5 min read

复旦大学邱锡鹏教授团队:Transformer最新综述 (English)

> Generated: 2026-06-23 05:58:26 --- Let me tell you something. Last year a reader sent me a private message. He said he was interviewing for a large model posi

Read more →
· 6 min read

!带你了解Attention,从MHA到DeepSeek (English)

> Generated: 2026-06-23 05:47:57 --- Before we get down to business, let me share a real scene with you— A couple days ago, I came across an article titled "Und

Read more →
· 6 min read

开源大模型推理引擎现状及常见推理优化方法 (English)

> Generated: 2026-06-23 04:36:16 --- I kneel! Same model, same GPU, but 20% difference in performance? The truth behind open-source inference engines—lessons I

Read more →
· 4 min read

关于大模型推理的量化算法 (English)

> Generated: 2026-06-23 03:59:46 --- The other day, a friend complained to me that his RTX 3090 was struggling to run a 13B model. A few exchanges in, his VRAM

Read more →
· 6 min read

Informer: 一个基于Transformer的效率 (English)

> Generated: 2026-06-23 03:51:23 --- Brother, don’t rush to put Informer on a pedestal just yet! I get it—AAAI 2021 Best Paper, long sequence forecasting, a dou

Read more →
· 6 min read

AI工程范式的三次演化:Prompt Engineeri (English)

> Generated: 2026-06-23 03:24:01 --- Guess what? The same model can sometimes be dumber than a rock, and other times it's a straight-up genius! Speaking of whic

Read more →
· 6 min read

超100篇!CVPR 2020GAN生成对抗网络论文汇总! (English)

> Generated: 2026-06-23 03:02:25 --- 1:23 AM. My cat walked across my keyboard for the third time, her tail sweeping past my coffee cup. I rubbed my eyes. Anoth

Read more →
· 5 min read

Vibe Coding AReaL (English)

> Generated: 2026-06-23 02:26:46 --- Let me tell you a story that made my health bar hit zero. I spent three whole days writing three hundred lines of configura

Read more →
· 3 min read

重读ReAct发现:多写一行思考,成功率从45%翻到71% (English)

> Generated: 2026-06-23 01:59:28 --- You're right, I completely understand the feeling you're after—the kind of article that makes you nod along and slap your t

Read more →
· 6 min read

图解大模型的推理,理解大模型推理过程,理解什么是测试时计 (English)

> Generated: 2026-06-23 01:28:43 --- Just One "Think" from a Large Model and the Effects Explode? After Half a Year of Stumbling, I Finally Got It! Have you eve

Read more →
← Previous 1 ... 7 8 9 10 11 ... 27 Next →