https://www.youtube.com/watch?v=fMJMhBFa_Gc
On the second day of its 12-day continuous announcement, OpenAI introduced a preview version of 'Reinforcement Advantageous-Tuning' using the inference model 'o1' and announced that it would be officially released next 12 months.
Open AI...