• Skip to primary navigation
  • Skip to content
  • Skip to footer
연구, 개발, 디버깅... 의미 있는 삽질 일지 연구, 개발, 디버깅... 의미 있는 삽질 일지 Actions make memories
  • Category
  • Tag
  • Search

    DimensionSTP

    Fun, creativity, and persistence

    • Seoul, Republic of Korea
    • Email
    • GitHub

    최근 포스트

    PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost Review

    2026-04-17 11 분 소요

    0. Introduction

    Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Review

    2026-04-16 16 분 소요

    0. Introduction

    Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Review

    2026-04-15 15 분 소요

    0. Introduction

    Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models Review

    2026-04-15 12 분 소요

    0. Introduction

    2 OLMo 2 Furious Review

    2026-04-13 10 분 소요

    0. Introduction

    • 이전
    • 1
    • 2
    • 3
    • …
    • 7
    • 다음
    • 팔로우:
    • GitHub
    • 피드
    © 2026 DimensionSTP. Powered by Jekyll & Minimal Mistakes.