feature-image
eng

Pietro Montino

AI Principal Consultant at Ennova Research

Automatic unstructured PDF ingestion on GCP

The scope of this session is to show how it is possible to create a Question Answering agent on GCP starting from complex documents, like technical handbooks. This will be done by using the following passages:

1. Document Segmentation
Semantic structures (like title, sections, tables...) will be inferred with State-of-the-Art models.

2. Optical Character Recognition (OCR)
Google OCR will be applied to the previously-determined structures in order to determine their content.

3. Vectorization
The text will be saved in a vector store in order to allow a fast information retrieval.

4. Large Language Model (LLM) Connection
The vector store will be connected to an LLM that will be used to provide the user with suitable answers for his/her questions.

This project will showcase the capabilities of State-of-the-Art models and technologies in the field of Generative AI.

Speaker Bio:

Helping companies accelerate growth and reduce costs through AI.