FlipVQA-Miner: Multimodal Knowledge Extraction

Multimodal Knowledge Extraction Pipeline Demo

Upload textbook or exam PDFs. MinerU parses the layout and an LLM extracts structured QA pairs, outputting raw_vqa.jsonl.

Pipeline: PDF Upload → MinerU Parsing → LLM QA Extraction → Download Results

All API calls use your own keys. This Space does not store any data or keys.

📄 Upload PDF

📋 Example PDFs (click to load)

⚙️ LLM Configuration

🏗️ MinerU Configuration

1 30

📤 Output

Result Preview