Abstract: Aim: The COVID-19 virus has made wearing masks a habit for living beings, the objective of this project is to provide a system with better accuracy for masked facial identification using the ...
Abstract: Visual grounding aims to ground an image region through natural language, which heavily relies on cross-modal alignment. Most existing methods transfer visual/linguistic knowledge separately ...