PDF Segmentator

Civil SocietyHuman Rights

Workflow Diagram

PDF Segmentator workflow diagram

Task

A user requires their PDF to be segmented, so an AI system segments it into an annotated PDF and JSON.

Intent

The AI system supports users in collecting information they need for different tasks.

AI Workflow

Input

A user submits a single PDF to a user-interface (UI).

Process

Model 1 segments the PDF; its output feeds Model 2 which classifies the segmentations. Outputs are a JSON and an annotated PDF.

Output

The user checks the output. If satisfied with the annotated PDF, they accept it and use the JSON; otherwise, they reject and do not use the JSON.

Human Oversight Level

Human-Approved AI

Institutional Oversight Examples

  • Models should be pulled from a specific commit number (ad-hoc practice)
  • Services that use the AI should have a release version (ad-hoc practice)
  • All benchmarks should be saved in a public repository (organization best practice)
  • Test sets to assess that performance is maintained (best practice)
  • Code implemented is open-sourced (organization policy)
  • Services should be covered with unit tests, integration tests, and end-to-end tests (organization policy)

Risk

The AI system could make mistakes when annotating and segmenting the PDF.