file

Test JPEG for KG Basic

01KJTN2S6NESHSE1R14QPVMAXX

Content

test.jpeg

key
test.jpeg
cid
bafkreifhjw43apnzrccsyufncn7pph5vpgzlyahg2owhncuiiem7kota5u
content_type
image/jpeg
size
1.4 MB (1,452,844 bytes)
uploaded_at
2026-03-03T20:08:57.596Z

Properties

ocr_images_extracted
1
ocr_model
mistral-ocr-latest
ocr_source_file_key
test.jpeg
text
# Meet Emma Emma runs a small regional archive. She has thousands of digitized documents—letters, diaries, local government records—but they sit in folders with minimal metadata. No descriptions. No transcriptions. Her three-person team doesn't have the capacity to catalog them properly, let alone make them searchable. The materials technically exist digitally, but they're practically invisible. Researchers can't find them. Search engines can't index them. AI systems don't know they exist. And if her institution's server fails, decades of local history could disappear. ![img-0.jpeg](arke:01KJTN34CQ184G3GMYT48QVZ8D)
text_extracted_at
2026-03-03T20:09:08.979Z
text_source
ocr

Relationships

  • has_extracted01KJTN34CQ184G3GMYT48QVZ8Dfile
  • extracted_entityletters
    entity_type
    document_type
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entitylocal government records
    entity_type
    document_type
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entityemma
    entity_type
    person
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entityregional archive
    entity_type
    organization
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entityemmas team
    entity_type
    group
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entitymetadata
    entity_type
    concept
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entitydiaries
    entity_type
    document_type
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entitydigitized documents
    entity_type
    document_collection
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entityresearchers
    entity_type
    group
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entityinstitutions server
    entity_type
    infrastructure
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entitysearch engines
    entity_type
    system_type
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entityai systems
    entity_type
    system_type
    extracted_at
    2026-03-03T20:09:29.974Z
  • extracted_entitylocal history
    entity_type
    concept
    extracted_at
    2026-03-03T20:09:29.974Z