Posts tagged "mathematics"

3 posts found

February 16, 2026

MaxRL: From REINFORCE to Maximum Likelihood

Why dividing by the number of successes instead of the batch size changes what your gradient estimator optimizes — and how this connects REINFORCE, maximum likelihood, and pass@k through one clean mathematical identity.

reinforcement-learningmachine-learningpolicy-gradient

February 15, 2026

Reinforcement Learning from Scratch

Building RL from the ground up — actions, rewards, policies, expected reward, the policy gradient theorem, and REINFORCE — all derived step by step with concrete examples.

reinforcement-learningmachine-learningpolicy-gradient

February 14, 2026

Mathematical Prerequisites for Reinforcement Learning

Building the math foundations you need for RL — probability, expected value, derivatives, the log trick, and Monte Carlo estimation — all through one consistent example.

reinforcement-learningmathematicsmachine-learning