SyncNet

This repository contains the demo for the audio-to-video synchronisation network (SyncNet). This network can be used for audio-visual synchronisation tasks including:

Removing temporal lags between the audio and visual streams in a video;
Determining who is speaking amongst multiple faces in a video.

Please cite the paper below if you make use of the software.

Dependencies

pip install -r requirements.txt

In addition, ffmpeg is required.

Demo

SyncNet demo:

python demo_syncnet.py --videofile data/example.avi --tmp_dir /path/to/temp/directory

Check that this script returns:

AV offset:      3 
Min dist:       5.353
Confidence:     10.021

Full pipeline:

sh download_model.sh
python run_pipeline.py --videofile /path/to/video.mp4 --reference name_of_video --data_dir /path/to/output
python run_syncnet.py --videofile /path/to/video.mp4 --reference name_of_video --data_dir /path/to/output
python run_visualise.py --videofile /path/to/video.mp4 --reference name_of_video --data_dir /path/to/output

Outputs:

$DATA_DIR/pycrop/$REFERENCE/*.avi - cropped face tracks
$DATA_DIR/pywork/$REFERENCE/offsets.txt - audio-video offset values
$DATA_DIR/pyavi/$REFERENCE/video_out.avi - output video (as shown below)

Device Support

This implementation supports both CUDA GPU and CPU execution:

CUDA GPU: Automatically detected and used if available for faster processing
CPU: Used as fallback when CUDA is not available, or can be forced for compatibility

Device Selection

The code automatically detects and uses the best available device:

If CUDA is available → Uses GPU for acceleration
If CUDA is not available → Falls back to CPU

CPU-Only Execution

To force CPU-only execution (e.g., for compatibility or debugging), you can set:

import os
os.environ['CUDA_VISIBLE_DEVICES'] = ''

Or modify the device selection in the scripts directly.

Publications

@InProceedings{Chung16a,
  author       = "Chung, J.~S. and Zisserman, A.",
  title        = "Out of time: automated lip sync in the wild",
  booktitle    = "Workshop on Multi-view Lip-reading, ACCV",
  year         = "2016",
}

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
detectors		detectors
img		img
.gitignore		.gitignore
FastSyncNetInstance.py		FastSyncNetInstance.py
LICENSE.md		LICENSE.md
README.md		README.md
RealtimeSyncMonitor.py		RealtimeSyncMonitor.py
SimpleFastSyncNet.py		SimpleFastSyncNet.py
SmartAdaptiveSyncNet.py		SmartAdaptiveSyncNet.py
SyncNetInstance.py		SyncNetInstance.py
SyncNetModel.py		SyncNetModel.py
debug_test.py		debug_test.py
demo_fast_syncnet.py		demo_fast_syncnet.py
demo_feature.py		demo_feature.py
demo_simple_fast.py		demo_simple_fast.py
demo_smart_adaptive.py		demo_smart_adaptive.py
demo_syncnet.py		demo_syncnet.py
device_utils.py		device_utils.py
download_model.sh		download_model.sh
news_audio_delay_360p_500ms.mp4		news_audio_delay_360p_500ms.mp4
requirements.txt		requirements.txt
run_pipeline.py		run_pipeline.py
run_syncnet.py		run_syncnet.py
run_visualise.py		run_visualise.py
test_comprehensive_sync.py		test_comprehensive_sync.py
test_final_monitor.py		test_final_monitor.py
test_improved_monitor.py		test_improved_monitor.py
test_realtime_monitor.py		test_realtime_monitor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SyncNet

Dependencies

Demo

Device Support

Device Selection

CPU-Only Execution

Publications

About

Uh oh!

Releases

Packages

Languages

License

JimmyOhn/syncnet_python

Folders and files

Latest commit

History

Repository files navigation

SyncNet

Dependencies

Demo

Device Support

Device Selection

CPU-Only Execution

Publications

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages