Noul model codex are practic un context nelimitat, antrenat cu RL end-to-end