Huifeng Chen josherich

This man is Untitled

47 followers · 51 following

New York City
https://www.josherich.me

View GitHub Profile

Recently created

Least recently created

Recently updated

Least recently updated

josherich / a.md

Created July 3, 2025 17:10

A Galileo moment for LLM design

A Galileo Moment for LLM Design

Introduction

This document discusses the significant advancements in Large Language Model (LLM) architecture design, drawing parallels to pivotal moments in the history of science, such as the Pisa Tower experiment that catalyzed modern physics. Our findings reveal the true limits of LLM architectures through a controlled synthetic pretraining environment, marking a potential turning point in LLM research that may delineate the field into “before” and “after.”

josherich / 1.md

Created July 23, 2025 20:58

Beyond Binary Rewards: Training LMs to Reason About Their Uncertainty

New Research Paper Announcement

Introduction

We are pleased to announce the release of our latest research paper, which explores the training of reasoning Large Language Models (LLMs) to effectively reason about their areas of uncertainty. This work is particularly relevant in high-stakes domains such as healthcare and law, where the reliability of LLMs is critical.

Key Findings

Reasoning Training and its Challenges

OlderNewer