Vox-adv-cpk.pth.tar Link

Classification: Deep Learning Model Checkpoint Primary Architecture: First Order Motion Model (FOMM) Primary Application: Image Animation / Face Re-enactment Framework: PyTorch


import torch
from demo import load_checkpoints

generator, kp_detector = load_checkpoints( config_path='config/vox-256.yaml', checkpoint_path='vox-adv-cpk.pth.tar', device='cuda' ) Vox-adv-cpk.pth.tar

The following Python pseudocode demonstrates loading the file and running a forward pass: Vox-adv-cpk.pth.tar

import torch
from models.wav2lip import Wav2LipModel

The file Vox-adv-cpk.pth.tar is a pre-trained weight checkpoint used primarily in the field of computer vision for facial re-enactment. It allows a user to animate a static image of a person (the "source") using the facial expressions and head movements of a driving video (the "driver"). Vox-adv-cpk.pth.tar

This specific checkpoint is part of the research popularized by the paper "First Order Motion Model for Image Animation" (Siarohin et al., 2019). The naming convention Vox-adv-cpk breaks down as follows:

File Type: PyTorch Serialized Checkpoint (Model Weights) Primary Association: First Order Motion Model for Image Animation Architecture Origin: NeurIPS 2019 (Paper: "First Order Motion Model for Image Animation" by Siarohin et al.) Dataset Origin: VoxCeleb Dataset