Focused on the VTP (Visual Tokenizer) backbone for high-fidelity generation.
visionarchitectureresearch