Question 1

What makes a document extraction auditable?

Accepted Answer

An auditable extraction keeps the answer connected to the source document. Ninjadoc returns page indexes, bounding boxes, evidence text, citation URLs, and cropped citation URLs so reviewers can verify where each value came from.

Question 2

Are citation URLs different from page numbers?

Accepted Answer

Yes. Page numbers tell a reviewer where to look. Citation URLs give the application or agent a direct source artifact that can be opened, embedded, or stored in an audit log.

Question 3

Why do bounding boxes matter?

Accepted Answer

Bounding boxes identify the exact source region for an extracted value. They let your product highlight, crop, review, and explain the original evidence instead of asking a user to search the whole document.

Question 4

Can agents use this evidence downstream?

Accepted Answer

Yes. The evidence object is returned as structured JSON and can be passed through APIs, databases, review queues, and MCP-compatible agent tools.

Every extracted answer comes with proof.

Answer

Evidence text

Bounding box

Citation URL

AI outputs become usable when reviewers can verify them.

Use it when the output needs review

What makes a document extraction auditable?

Are citation URLs different from page numbers?

Why do bounding boxes matter?

Can agents use this evidence downstream?

Build the audit trail into extraction.