MITB Banner

Yann LeCun Cherry-picks Reinforcement Learning 

LeCun clearly is at odds with reinforcement learning and believes that for AI with common sense, it is not the way forward

Share

Listen to this story

The self-supervised learning guru and chief AI scientist at Meta AI, Yann LeCun, introduced the ‘cake analogy’, at NIPS 2016. “If intelligence is a cake, the bulk of the cake is unsupervised learning, the icing on the cake is supervised learning, and the cherry on the cake is reinforcement learning.”

However, while delivering a talk about SSL at NeurIPS 2022 in the context of achieving AGI, LeCun suggested abandoning the four most popular things at the momentum machine learning, including generative models, probabilistic models, contrastive methods, and reinforcement learning, Mofijul Islam pointed out on Twitter. 

To this, LeCun replied saying that he is not entirely unfamiliar with making statements that go against the common wisdom of the day. 

Meanwhile, several Twitter users and AI experts also joined in to express their views about it. Kyle Cranmer, physicist and professor at NYU, said that though he is a fan of generative models and probabilistics methods, he agrees with LeCun about the need of world models for reinforcement learning. Other people too proposed the idea of having a model for reinforcement learning and that a lot of research is now focused on just creating generative models. 

Cranmer added that LeCun makes statements that he thinks do not need explanation and though he agrees with him largely, he is willing to discuss the things that he is at odds with. LeCun agrees that just by looking at one slide from the presentation, people are making assumptions and drawing conclusions.

November shines bright for generative AI 

November 2022 was a great month for AI. Apart from NeurIPS, OpenAI released ChatGPT, an excellent chatbot that is touted to be a ‘Google Killer’. Stability AI announced the release of Stable Diffusion 2.0. Mind-vis, a mental image reading algorithm was released. Meta AI also released two new models – CICERO; the first model to achieve human level competence in ‘Diplomacy, and Galactica; trained on 120 billion parameters, focusing on scientific papers to aid academic research.

According to the research paper of Galactica, the model outperformed GPT-3 in technical knowledge probes of LaTeX equations. Many researchers and AI enthusiasts were excited to try it out for its generative capabilities. But it did not take long for the community to figure out that a lot of predictions and outcomes that were produced were inaccurate and hallucinating. This eventually led to Meta AI pulling the plug on the demo.

CICERO, on the other hand, the paper clearly states, integrates a language model using reinforcement learning algorithms and using human intervention and conversations to generate diplomacy. Maybe from the fall of this model, LeCun inferred that generative and predictive models might not achieve the goal that he had in mind – AGI. 

Read: Meet the Meta AI Researcher Who Helped Build CICERO

OpenAI’s ChatGPT has been gaining significant popularity since its release last week with people experimenting with it and touting it to be a glimpse of GPT-4. This clearly highlights the importance of generative models. The rise of image generation models like Stable Diffusion or DALL-E also add points to the same.

Russ Salakhutdinov, UPMC professor of CS at Carnegie Mellon University and former director of AI research at Apple, praised the Galactica paper but also pointed out that LeCun was earlier promoting the generative and probabilistic methods and capabilities of Galactica, but now after its downfall, says that these methods should be replaced with joint-embedding architectures.

To LeCun’s credit, he replied saying that during the talk at the conference he explained that people interested in applications of generative and predictive models can clearly use them, but his recommendations are for researchers pursuing the path towards imparting common sense and reasoning capabilities in AI.  He recommends VICreg, a Meta AI developed algorithm for variance-invariance-covariance regularisation for SSL, to replace contrastive methods.

Yann LeCun has been on the headline several times, not just for amazing innovations, but also for various controversies. In May, the very popular debate about ‘AI hitting the wall’ surfaced when Gary Marcus tweeted a video of Tesla hitting an aeroplane. Similarly in July, LeCun’s paper – A Path Towards Autonomous Machine Intelligence — was facing controversy when Jurgen Schmidhuber claimed that LeCun’s ‘original contributions’ actually felt to him like deja vu of his work.

Share
Picture of Mohit Pandey

Mohit Pandey

Mohit dives deep into the AI world to bring out information in simple, explainable, and sometimes funny words. He also holds a keen interest in photography, filmmaking, and the gaming industry.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.