Test Collection

Version: 9 (current) | Updated: 11/12/2025, 2:54:42 PM

Added description

Description

Test Collection

Overview

The Test Collection is a digital archive assembled in 2017 and catalogued as a Collection in the PINAX system. It is written in English and is intended to preserve a variety of written material, including tutorials, announcements, essays, and HTML files. The collection also contains a structured dataset of newsletter posts and metadata for system files generated by the Arke Institute’s preservation workflow.

Background

The collection was compiled by a combination of creators: the publishing platform Substack, the individual or persona Nchimicles, and an unidentified third contributor. It is held by the institution labeled “test” and was created for archival or testing purposes within the PINAX environment. No specific rights information is provided, and the collection’s provenance is linked to the Arke Institute’s digital preservation activities.

Contents

  • Posts: A set of written posts covering tutorials, announcements, essays, and HTML documents.
  • Newsletter Posts Dataset: A JSON metadata file (`pinax.json`) and a CSV file (`filepostscsv`) that provide structured information about newsletter posts, primarily from Substack.
  • System Files Metadata: A textual metadata record, a JSON representation of that metadata, and a binary `.DS_Store` file that records the original directory structure.

The subjects associated with the collection include “tutorial,” “guide,” “Substack,” “editor,” “formatting,” “text editing,” “announcements,” “updates,” “subscriptions,” “indulgence,” “modern lifestyle,” “essay,” “HTML,” “Web Content,” “Empty Files,” “newsletters,” “content management,” “metadata,” “digital preservation,” and “system files.”

Scope

The collection covers material created in 2017 (posts) and 2023 (system files metadata), all in English. It focuses on textual content and metadata related to online publishing and digital preservation. The collection does not include audio, video, or other media types. It is intended for researchers or archivists interested in the workflow of Substack-based newsletters, content‑management practices, and the preservation of digital assets.

Entities

(loading...)

Entity Relationships

(loading...)

Raw Cheimarros Data

@file_pinax -> documents -> @test_collection:document {title: "Test Collection", created: @date_2017, description: "A collection of posts including tutorials, announcements, essays, and HTML files, along with a dataset of newsletter posts and system files metadata."} {source: "PINAX", rights: "Unknown", access_url: "PLACEHOLDER"}

@test_collection -> creator -> [@arke_institute, @substack:organization {full_name: "Substack"}, @nchimicles, @unknown:person {description: "Unidentified creator"}]

@test_collection -> subject -> [@tutorial, @guide, @substack_editor, @formatting, @text_editing, @announcements, @updates, @subscriptions, @indulgence, @modern_lifestyle, @essay, @html, @web_content, @empty_files, @newsletters, @content_management, @metadata, @digital_preservation, @system_files]

Metadata

Version History (9 versions)

  • ✓ v9 (current) · 11/12/2025, 2:54:42 PM
    "Added description"
  • v8 · 11/12/2025, 2:53:50 PM · View this version
    "Added knowledge graph extraction"
  • v7 · 11/12/2025, 2:51:38 PM · View this version
    "Added PINAX metadata"
  • v6 · 11/12/2025, 2:50:53 PM · View this version
    "Reorganized into 2 groups"
  • v5 · 11/12/2025, 2:50:52 PM · View this version
    "Added child entity 01K9W8RJYMXWNH5QE811ZYZWMC"
  • v4 · 11/12/2025, 2:50:51 PM · View this version
    "Added child entity 01K9W8RHQD9HVTNWW8JKD9KFQX"
  • v3 · 11/12/2025, 2:50:38 PM · View this version
    "Set parent to 01K9W8R37SEN45JDVE5KG93V79"
  • v2 · 11/12/2025, 2:50:38 PM · View this version
    "Added children"
  • v1 · 11/12/2025, 2:50:36 PM · View this version
    "Initial snapshot"

Additional Components

reorganization-description.txt
# Reorganization Summary

The organizational strategy focuses on separating user content from system files. The 'newsletter_posts' group contains the CSV file with newsletter post data, while the 'system_files' group includes the reference file for a system-generated file. This approach ensures that content is logically grouped based on its purpose and origin, avoiding unnecessary meta-categories.

## Groups Created

- **newsletter_posts**: Files containing data related to newsletter posts, including post metadata such as publication status, audience, and titles.
- **system_files**: System-related files that do not contain user content but are necessary for system operations.

Parent

01K9W8R37SEN45JDVE5KG93V79

Children (3)