Abstract
Despite recent successes, most contrastive self-supervised learning methods are domain-specific, relying heavily on data augmentation techniques that require knowledge about a particular domain, such as image cropping and rotation. To overcome such limitation, we propose a domain-agnostic approach to contrastive learning, named DACL, that is applicable to problems where domain-specific data augmentations are not readily available. Key to our approach is the use of Mixup noise to create similar and dissimilar examples by mixing data samples differently either at the input or hidden-state levels. We theoretically analyze our method and show advantages over the Gaussian-noise based contrastive learning approach. To demonstrate the effectiveness of DACL, we conduct experiments across various domains such as tabular data, images, and graphs. Our results show that DACL not only outperforms other domain-agnostic noising methods, such as Gaussian-noise, but also combines well with domain-specific methods, such as SimCLR, to improve self-supervised visual representation learning.
Original language | English |
---|---|
Title of host publication | Proceedings of the 38 th International Conference on Machine Learning |
Publisher | JMLR |
Number of pages | 12 |
Publication status | Published - 2021 |
MoE publication type | A4 Conference publication |
Event | International Conference on Machine Learning - Virtual, Online Duration: 18 Jul 2021 → 24 Jul 2021 Conference number: 38 |
Publication series
Name | Proceedings of Machine Learning Research |
---|---|
Publisher | PMLR |
Volume | 139 |
ISSN (Electronic) | 2640-3498 |
Conference
Conference | International Conference on Machine Learning |
---|---|
Abbreviated title | ICML |
City | Virtual, Online |
Period | 18/07/2021 → 24/07/2021 |