VOCASET

Name: VOCASET
Published: 2019-05-08
License: Unknown

Dataset Information

Modalities

3D, Speech

Introduced

2019

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

VOCASET is a 4D face dataset with about 29 minutes of 4D scans captured at 60 fps and synchronized audio. The dataset has 12 subjects and 480 sequences of about 3-4 seconds each with sentences chosen from an array of standard protocols that maximize phonetic diversity.

Source: timzhang642

Variants: VOCASET

Associated Benchmarks

This dataset is used in 1 benchmark:

3D Face Animation - Metrics: Lip Vertex Error

Recent Benchmark Submissions

Task	Model	Paper	Date
3D Face Animation	FaceFormer	FaceFormer: Speech-Driven 3D Facial Animation …	2021-12-10
3D Face Animation	MeshTalk	MeshTalk: 3D Face Animation from …	2021-04-16

Research Papers

Recent papers with results on this dataset:

External Links:

VOCASET

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview