SyncNet

This repository contains the demo for the audio-to-video synchronisation network (SyncNet). This network can be used for audio-visual synchronisation tasks including:

Removing temporal lags between the audio and visual streams in a video;
Determining who is speaking amongst multiple faces in a video.

The model can be used for non-commercial research purposes under Creative Commons Attribution License. Please cite the paper below if you make use of the software.

Prerequisites

The following packages are required to run the SyncNet demo:

python (2.7.12)
pytorch (0.4.0)
numpy (1.14.3)
scipy (1.0.1)
opencv-python (3.4.0) - via opencv-contrib-python
python_speech_features (0.6)
cuda (8.0)
ffmpeg (3.4.2)

In addition to above, these are required to run the full pipeline:

tensorflow (1.2, 1.4)
pyscenedetect (0.3.5) - does not work with 0.4

The demo has been tested with the package versions shown above, but may also work on other versions.

Demo

SyncNet demo:

python demo_syncnet.py --videofile data/example.avi --tmp_dir /path/to/temp/directory

Check that this script returns:

AV offset:      4 
Min dist:       6.568
Confidence:     9.889

Full pipeline:

sh download_model.sh
python run_pipeline.py --videofile /path/to/video.mp4 --reference name_of_video --data_dir /path/to/output
python run_syncnet.py --videofile /path/to/video.mp4 --reference name_of_video --data_dir /path/to/output
python run_visualise.py --videofile /path/to/video.mp4 --reference name_of_video --data_dir /path/to/output

Outputs:

$DATA_DIR/pycrop/$REFERENCE/*.avi - cropped face tracks
$DATA_DIR/pywork/$REFERENCE/offsets.txt - audio-video offset values
$DATA_DIR/pyavi/$REFERENCE/video_out.avi - output video (as shown below)

Publications

@InProceedings{Chung16a,
  author       = "Chung, J.~S. and Zisserman, A.",
  title        = "Out of time: automated lip sync in the wild",
  booktitle    = "Workshop on Multi-view Lip-reading, ACCV",
  year         = "2016",
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
img		img
utils		utils
.gitignore		.gitignore
README.md		README.md
SyncNetInstance.py		SyncNetInstance.py
SyncNetModel.py		SyncNetModel.py
demo_syncnet.py		demo_syncnet.py
download_model.sh		download_model.sh
run_pipeline.py		run_pipeline.py
run_syncnet.py		run_syncnet.py
run_visualise.py		run_visualise.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SyncNet

Prerequisites

Demo

Publications

About

Uh oh!

Releases

Packages

Languages

benjisympa/syncnet_python

Folders and files

Latest commit

History

Repository files navigation

SyncNet

Prerequisites

Demo

Publications

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages