TY - GEN
T1 - A study of contrastive self-supervised learning generalization based on augmented data
AU - Zhang, Jiarui
AU - Wang, Tian
AU - Wang, Jian
AU - Li, Ce
AU - Fu, Yao
AU - Soussi, Hichem
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Contrastive learning have been widely applied recently, since only with unlabeled data, it produced results comparable to the state-of-the-art supervised algorithm. Data augmentation plays an essential role in defining effective predictive task, influencing generalization of the model. We find that high-intensity data augmentation is conducive to achieving a wider aggregation of intra-class samples, obtaining more effective features to improve generalization; however, extremely strong augmentation leads to the information loss of different classes, resulting in cross-class deviation of samples, reducing model generalization. We demonstrate our finding through experiment on CIFAR-10 dataset, analyzing the impact of different data augmentation operations on recognition results and the corresponding reasons. Our experimental verification on the relationship between the data augmentation and the generalization of contrastive self-supervised learning agree with our theoretical description.
AB - Contrastive learning have been widely applied recently, since only with unlabeled data, it produced results comparable to the state-of-the-art supervised algorithm. Data augmentation plays an essential role in defining effective predictive task, influencing generalization of the model. We find that high-intensity data augmentation is conducive to achieving a wider aggregation of intra-class samples, obtaining more effective features to improve generalization; however, extremely strong augmentation leads to the information loss of different classes, resulting in cross-class deviation of samples, reducing model generalization. We demonstrate our finding through experiment on CIFAR-10 dataset, analyzing the impact of different data augmentation operations on recognition results and the corresponding reasons. Our experimental verification on the relationship between the data augmentation and the generalization of contrastive self-supervised learning agree with our theoretical description.
KW - contrastive learning
KW - data augmentation
KW - generalization
KW - self-supervised learning
UR - https://www.scopus.com/pages/publications/85185571036
U2 - 10.1109/YAC59482.2023.10401602
DO - 10.1109/YAC59482.2023.10401602
M3 - 会议稿件
AN - SCOPUS:85185571036
T3 - Proceedings - 2023 38th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2023
SP - 659
EP - 664
BT - Proceedings - 2023 38th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 38th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2023
Y2 - 27 August 2023 through 29 August 2023
ER -