We introduce VIRAL (VIsual Representation ALignment), a simple regularization strategy that explicitly aligns intermediate visual features in MLLMs with representations from pretrained vision encoders ...
At present, I have an error when installing opencv-python in the ARM version of python 3.12, is there a WHL file or other installation method available?