BIF-Face

synthetic face recognition dataset

Get Data


Description

Training face recognition models requires a large amount of identity-labeled face images, which are often collected by crawling the web, and therefore have ethical and privacy concerns. Recently, generating synthetic face datasets and training face recognition models using synthetic datasets has emerged to be a viable solution. We present BIF-Face, a new synthetic face recognition dataset. We use the Brownian identity diffusion to generate synthetic identities, and then build synthetic face recognition datasets by generating different samples per each identity using a foundation model. Our experimental results show that face recognition models trained with BIF-Face achieve competitive performance with face recognition models trained on state-of-the-art synthetic face recognition datasets.

Reference

 

If you use this dataset, please cite the following publication:

@inproceedings{shahreza2025generating,
  title={Generating Synthetic Face Recognition Datasets Using Brownian Identity Diffusion and a Foundation Model},
  author={Shahreza, Hatef Otroshi and Marcel, S{\'e}bastien},
  booktitle={2025 IEEE 35th International Workshop on Machine Learning for Signal Processing (MLSP)},
  pages={1--6},
  year={2025},
  organization={IEEE}
}