Created
July 19, 2025 06:40
-
-
Save skylarbpayne/e9a9cc11e5d4c2c010ccb851ceef30e6 to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# Recent Research Papers on Machine Learning | |
## Summary of Work | |
1. **Hierarchical Rectified Flow Matching with Mini-Batch Couplings**: Introduces a hierarchical flow matching model to better capture multi-modality in velocity fields for generative modeling, with benefits shown in synthetic and imaging data. | |
2. **VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning**: Proposes VisionThink, a dynamic approach to visual token compression in vision-language models using reinforcement learning to optimize token use based on task complexity. | |
3. **Latent Policy Steering with Embodiment-Agnostic Pretrained World Models**: Focuses on reducing data collection for visuomotor policies by leveraging multi-embodiment datasets using optic flow and World Models, improving policy performance. | |
4. **Training Transformers with Enforced Lipschitz Constants**: Develops tools for maintaining norm-constrained weights in transformers to enhance stability and robustness, exploring performance tradeoffs. | |
5. **The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner**: Proposes TAIL to improve length generalization in language models via Turing Machine imitation learning, showing significant performance improvements on reasoning tasks. | |
## Papers | |
1. [Hierarchical Rectified Flow Matching with Mini-Batch Couplings](http://arxiv.org/pdf/2507.13350v1) - Yichi Zhang et al. | |
2. [VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning](http://arxiv.org/pdf/2507.13348v1) - Senqiao Yang et al. | |
3. [Latent Policy Steering with Embodiment-Agnostic Pretrained World Models](http://arxiv.org/pdf/2507.13340v1) - Yiqi Wang et al. | |
4. [Training Transformers with Enforced Lipschitz Constants](http://arxiv.org/pdf/2507.13338v1) - Laker Newhouse et al. | |
5. [The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner](http://arxiv.org/pdf/2507.13332v1) - Zhouqi Hua et al. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment