z-news.link

The neural network is trained to create talking heads

The developers have created a system that long-term study on a large set of video data.

Training convolutional neural networks, the Russian developers of Samsung and the SKOLKOVO Institute of science and technology (Skoltech) animated photography, portraits and paintings.

It is known that to synthesize realistic avatars is difficult for two reasons. First, the human head has a high photometric, geometric and kinematic complexity: difficulties arise not only in the process of face modeling, but the mouth, hair and clothes. Another complicating factor is the acuity of the human visual system, which results in the effect of “sinister valley.” According to the hypothesis, if the robot makes mistakes in trying to emulate someone causes an uncontrollable disgust of people-observers.

To create a personalized model of a talking head with artificial intelligence requires training on a large set of images of the hero. However, in many applications, such models must be obtained from multiple images of a person, perhaps even from one. The developers have created a system that long-term study on a large set of video data and generates the mask of the speaker face. The mask represents the border of the face and the basic facial expressions. The relationship resulting mask with the source video is stored in a vector, so the mask can be transferred to individual images.

In the process of metalocene neural network automatiseret the process of selecting and configuring components. Three models were trained on a large database of video interviews with celebrities, found in the vast Youtube. Network-embedder transformed masks, coupled with the characteristics of the person, in vectors. These vectors were used to initialize the network settings generator. And network generator, in turn, has formed a video which the network discriminator was compared with the original and appreciated the realism of the result.

The system was tested by applying as the lead of the video the video with the front camera, as well as images which is carried out transfer — selfie-photos. 32 images sufficient to obtain high-quality “talking head”.

paradox

Next Samsung has shown a camera module with 5x optical zoom »

Previous « Sony will not close the unprofitable unit for the production of smartphones

Published by

paradox

Tags: createheadsnetworkneuraltalkingtrained

5 years ago

Web Scraping
Web Scraping is a popular method of getting content from almost nothing. Some specialists call…
Дизельная электростанция 500 кВт
Централизованная система энергоснабжения не исключает различного рода отклонений от нормального режима работы, результатом которых зачастую…
3D принтеры и экономика мира
Аддитивные технологии становятся все более популярными среди глобальных брендов, количество которых сегодня уже превысило 30%.…

Much of Ukraine aid stolen – French party leader

A large part of Western aid to Kiev is being embezzled by Ukrainian officials, despite…

5 hours ago

WORLD

Drone raid on Russian energy infrastructure repelled

Russia repelled a wave of attempted Ukrainian drones strikes on oil refineries and energy infrastructure…

14 hours ago

WORLD

Blinken in Beijing: The US tried to turn China against Russia – but did it work?

Antony Blinken traveled to China this week to warn Beijing about sanctions for supplying military…

23 hours ago

WORLD

US created Ukraine conflict – Shoigu

The Ukraine conflict is Washington’s doing and the US is deliberately trying to prolong the…

1 day ago

WORLD

Pentagon unveils targets for ATACMS missiles secretly shipped to Ukraine – NYT

The US-supplied Army Tactical Missile Systems, known as ATACMS, will allow Ukrainian forces to target…

2 days ago

WORLD

President admits hugging nukes

Belarusian President Alexander Lukashenko has revealed he once got up-close and personal with a “strategic…