StreamYard On-Air

How to Build Multimodal Document RAG with Llama 3.2 Vision and ColQwen2

In this event, we'll discuss how you can perform RAG over complex PDF documents that contain images, graphs, tables text charts, and more! We'll describe in detail how: - The new image retriever ColPali works - How you can finetune ColPali to improve further for your use-case - How to leverage multi-vector retrieval to retrieve from PDFs - How to use language vision models like the new Llama 3.2 vision series to perform document RAG

Already registered?