Machine Learns Substack
Subscribe
Sign in
Home
Notes
Archive
About
Latest
Top
Discussions
Machine Learns #51
OpenAI locks down labs after DeepSeek allegations, CEO exits X & Anthropic reaches $4B, 11 new models include Kyutai TTS & FLUX.1 Kontext, plus research…
Jul 9
•
Eren Gölge
1
Share this post
Machine Learns Substack
Machine Learns #51
Copy link
Facebook
Email
Notes
More
June 2025
Machine Learns #50
Anthropic's copyright victory, OpenAI & Google in court , model releases POLARIS & MiniMax-M1, ZipVoice TTS & LeVo song generation, plus new research in…
Jun 25
•
Eren Gölge
Share this post
Machine Learns Substack
Machine Learns #50
Copy link
Facebook
Email
Notes
More
Machine Learns #49
OpenAI's Google partnership & social network, BYD's 1000kW EV charging, new models like Gemma 3n & Magistral-Small, plus breakthrough research in long…
Jun 11
•
Eren Gölge
2
Share this post
Machine Learns Substack
Machine Learns #49
Copy link
Facebook
Email
Notes
More
May 2025
Machine Learns #48
OpenAI's 'Sign in with ChatGPT', Meta's AGI ambitions, new models like Gemma 3 & MAGI-1, research breakthroughs in KV caching for diffusion & PaTH…
May 28
•
Eren Gölge
Share this post
Machine Learns Substack
Machine Learns #48
Copy link
Facebook
Email
Notes
More
Paper check: Merging LLMs at Pre-training, Considering Token Probabilities at RL
🔬Two papers in scope: "Model Merging in Pre-training for LLMs" and "Do Not Let Low-Probability Tokens Over-Dominate in RL"
May 21
•
Eren Gölge
Share this post
Machine Learns Substack
Paper check: Merging LLMs at Pre-training, Considering Token Probabilities at RL
Copy link
Facebook
Email
Notes
More
Machine Learns #47
No fluff bi-weekly AI newsletter: OpenAI/Microsoft renegotiations, AI leaderboards are broken, new models from Mistral & Microsoft, boosting recurrent…
May 14
•
Eren Gölge
Share this post
Machine Learns Substack
Machine Learns #47
Copy link
Facebook
Email
Notes
More
Paper check: Softpick, Canon Layers, Parallel Transformer
Going over recent Transformer paper - Softpick, Canon Layers, Parallel Transformer and my takes after benchmarking with BlaGPT
May 6
•
Eren Gölge
1
Share this post
Machine Learns Substack
Paper check: Softpick, Canon Layers, Parallel Transformer
Copy link
Facebook
Email
Notes
More
April 2025
Machine Learns #46
Mira Murati poaching researchers at ICLR, OpenAI launches gpt-image-1 API, DeepMind UK staff unionizing, Qwen3 & F Lite model releases, ICLR conference…
Apr 30
•
Eren Gölge
Share this post
Machine Learns Substack
Machine Learns #46
Copy link
Facebook
Email
Notes
More
Machine Learns #45
OpenAI's social network & GPT-4.1, China launches $8.2B AI fund, NVIDIA's US manufacturing push, new GLM-4 & MineWorld models, C3PO expert pathways…
Apr 16
•
Eren Gölge
1
Share this post
Machine Learns Substack
Machine Learns #45
Copy link
Facebook
Email
Notes
More
Machine Learns #44
Praxis Sam Altman's tech utopia, Amazon launches Nova Sonic voice AI, Midjourney returns with V7, Llama 4 models debut amid controversy, new…
Apr 9
•
Eren Gölge
1
Share this post
Machine Learns Substack
Machine Learns #44
Copy link
Facebook
Email
Notes
More
March 2025
Machine Learns #43
Gemini's real-time features launch, major tech acquisitions including Google's $32B Wiz deal, new DeepSeek model is out and 8 more new open models, plus…
Mar 26
•
Eren Gölge
Share this post
Machine Learns Substack
Machine Learns #43
Copy link
Facebook
Email
Notes
More
Machine Learns #42
China's new AI Agent Manus, Microsoft and OpenAI slowly separating, Anthropic raised $3.5 billion, Meta tests new AI chips, Google released Gemma-3…
Mar 12
•
Eren Gölge
1
Share this post
Machine Learns Substack
Machine Learns #42
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts