Researchers at Meta have developed an artificial intelligence system called Video Joint Embedding Predictive Architecture (V-JEPA) that learns about the physical world through videos and demonstrates a notion of surprise when presented with information that goes against the knowledge it has gleaned. The model, which does not make any assumptions about the physics of the world contained in the videos, can begin to make sense of how the world works.
According to Micha Heilbron, a cognitive scientist at the University of Amsterdam who studies how brains and artificial systems make sense of the world, the claims made by the researchers are "very plausible" and the results are "super interesting." Heilbron stated that the model's ability to learn about the physical world through observation is a significant achievement, as it allows the AI system to develop a notion of objects' permanence, similar to how children learn about the world.
The researchers used a test of infants' understanding of objects' permanence as a benchmark to evaluate the AI system's performance. In the test, a glass of water is placed on a desk and then hidden behind a wooden board. When the board is moved towards the glass, the AI system's response is similar to that of a 6-month-old infant, who is surprised when the board passes through the glass as if it were not there. By a year, almost all children have developed an intuitive notion of objects' permanence, and the V-JEPA model has achieved a similar level of understanding.
The development of V-JEPA has significant implications for the field of artificial intelligence, as it demonstrates the ability of AI systems to learn about the physical world through observation and develop a notion of objects' permanence. This achievement could lead to the development of more advanced AI systems that can interact with the physical world in a more intuitive and human-like way.
The researchers' approach to developing V-JEPA is notable for its lack of assumptions about the physics of the world contained in the videos. Instead, the model learns about the world through observation and develops a notion of objects' permanence through trial and error. This approach is similar to how children learn about the world, and it has significant implications for the development of AI systems that can interact with the physical world.
As the field of artificial intelligence continues to evolve, the development of V-JEPA is an important milestone. The model's ability to learn about the physical world through observation and develop a notion of objects' permanence is a significant achievement, and it has the potential to lead to the development of more advanced AI systems that can interact with the physical world in a more intuitive and human-like way.
Share & Engage Share
Share this article