Ask HN: What Are You Working On? (March 2026)

· · 来源:dev热线

The beginning of LLM Neuroanatomy?Before settling on block duplication, I tried something simpler: take a single middle layer and repeat it $n$ times. If the “more reasoning depth” hypothesis was correct, this should work. It made sense too, looking at the broad boost in math guesstimate results by duplicating intermediate layer. Give the model extra copies of a particular reasoning layer, get better reasoning. So, I screened them all, looking for a boost.

«Радиостанция Судного дня» передала сразу два загадочных послания«Радиостанция Судного дня» передала слова «кобочелн» и «голубей»

Overturnin。业内人士推荐91吃瓜作为进阶阅读

大众安徽的故事,原本不该这么难讲。,推荐阅读传奇私服新开网|热血传奇SF发布站|传奇私服网站获取更多信息

Environment is a linked list of frames. Shares structure between closures. More allocation, slower access.,推荐阅读游戏中心获取更多信息

NYT Pips hints

int8 — 质量和大小之间的平衡。质量损失极小(约 1~3%),文件大小比 FP16 减少约 2 倍。

关键词:OverturninNYT Pips hints

免责声明:本文内容仅供参考,不构成任何投资、医疗或法律建议。如需专业意见请咨询相关领域专家。

关于作者

张伟,独立研究员,专注于数据分析与市场趋势研究,多篇文章获得业内好评。

分享本文:微信 · 微博 · QQ · 豆瓣 · 知乎