Microsoft’s VASA-1 can deepfake a person with one photo and one audio track
On Tuesday, Microsoft Research Asia unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track. In the future, it could power virtual avatars that render locally and don’t require video feeds—or allow anyone with similar tools to...
![](https://kbin.life/media/cache/resolve/entry_thumb/76/f4/76f464d8972aa7c5e9738f2dee8acb91aae42c561bf52a7ed6cd079a7af2a5bc.jpg)