Skip to content

Instantly share code, notes, and snippets.

@willccbb
willccbb / grpo_demo.py
Last active August 16, 2025 05:06
GRPO Llama-1B
# train_grpo.py
#
# See https://github.com/willccbb/verifiers for ongoing developments
#
"""
citation:
@misc{brown2025grpodemo,
title={Granular Format Rewards for Eliciting Mathematical Reasoning Capabilities in Small Language Models},
author={Brown, William},
@sdhzdmzzl
sdhzdmzzl / bj-unicom-iptv.m3u
Last active June 30, 2025 09:17
北京联通iptv列表
#EXTINF:-1,CCTV-1
rtp://239.3.1.1:8000
#EXTINF:-1,CCTV-2
rtp://239.3.1.2:8004
#EXTINF:-1,CCTV-4
rtp://239.3.1.4:8216
#EXTINF:-1,CCTV-7
@aa65535
aa65535 / aria2.bat
Last active April 18, 2025 09:41
Aria2的配置文件 & 启动脚本
:: 启动后需要保留窗口, 关闭窗口则结束进程
aria2c --conf-path=aria2.conf -D