chanmuzi
<Multi-modal> FLAVA: A Foundational Language And Vision Alignment Model