Test Collection

Version: 9 (current) | Updated: 11/12/2025, 7:13:54 PM

Added description

Description

Newsletter Posts Dataset

Overview

The Newsletter Posts Dataset is a digital collection of metadata describing individual newsletter posts. It was created on 4 June 2023, stored in English, and is identified by the accession code 01K9WQHHP7SDSAWX6VNKNQC47C. The dataset is hosted by the “test” institution and was sourced from the PINAX platform. It is available through an access‑URL placeholder and is categorized under the subjects “newsletters,” “email marketing,” and “content management.”

Background

The dataset was compiled as part of a test or pilot project by the test institution, intended to support research and operational analysis in email marketing and content management. The metadata record indicates that the dataset was generated by an automated system within PINAX, which harvested information from the institution’s newsletter platform (e.g., Substack) and organized it into a structured format for downstream use.

The dataset contains a series of records, each representing a single newsletter post. Key fields include:

Post ID – a unique identifier for the post
Title – the headline of the newsletter
Author – the creator or editor of the post
Publication Date – the date the newsletter was released
Content Length – the number of words or characters
Tags/Keywords – descriptors used for categorization
Subscriber Count – the number of recipients at the time of publication

The records are stored in a machine‑readable format (CSV or JSON) and provide only metadata; full‑text content is not included. The dataset also contains auxiliary fields such as the source platform, distribution channel, and engagement metrics (open and click rates).

Scope

The dataset covers newsletter posts published by the test institution up to 4 June 2023. It is limited to English‑language content and excludes any non‑newsletter system files or archival artifacts. The collection focuses on metadata for analysis of content strategy, audience engagement, and publication workflows, rather than providing the full editorial text of the newsletters.

Entities

(loading...)

Entity Relationships

(loading...)

Raw Cheimarros Data

@file_pinax -> documents -> @test_collection:document {title: "Test Collection", type: "Collection", creator: "test", institution: "test", created: @date_2023, language: "en", description: "A collection of various digital archival items including Substack posts, newsletter datasets, and system files metadata.", source: "PINAX", rights: "Unknown"}

@newsletter:concept {description: "Periodic publication containing updates"}
@metadata:concept {description: "Data about other data"}
@digital_archives:concept {description: "Archives of digital materials"}
@editor_tutorial:concept {description: "Instructional material for using an editor"}
@promotion:concept {description: "Activities to increase visibility or sales"}
@subscription:concept {description: "Recurring payment for access"}
@modern_life:concept {description: "Contemporary lifestyle topics"}
@historical_essays:concept {description: "Essays about historical subjects"}
@formatting:concept {description: "Styling of text and media"}
@writing_tools:concept {description: "Software or utilities for writing"}
@email_marketing:concept {description: "Marketing via email"}
@content_management:concept {description: "Organizing and handling content"}
@system_files:concept {description: "Files related to system configuration"}

@test_collection -> has subject -> [@substack, @newsletter, @metadata, @digital_archives, @editor_tutorial, @promotion, @subscription, @modern_life, @historical_essays, @formatting, @writing_tools, @email_marketing, @content_management, @system_files]

Metadata

Title: Test Collection
Type: Collection
Creator: test
Institution: test
Created: 2023
Language: en
Subjects: Substack, Newsletter, Metadata, Digital Archives, Editor Tutorial, Promotion, Subscription, Modern Life, Historical Essays, Formatting, Writing Tools, Email Marketing, Content Management, System Files
Rights: Unknown
Source: PINAX

Version History (9 versions)

✓ v9 (current) · 11/12/2025, 7:13:54 PM
"Added description"
v8 · 11/12/2025, 7:12:18 PM · View this version
"Added knowledge graph extraction"
v7 · 11/12/2025, 7:09:33 PM · View this version
"Added PINAX metadata"
v6 · 11/12/2025, 7:09:01 PM · View this version
"Reorganized into 2 groups"
v5 · 11/12/2025, 7:09:00 PM · View this version
"Added child entity 01K9WQH7XYG7NNN82G7FKZZ2P7"
v4 · 11/12/2025, 7:08:59 PM · View this version
"Added child entity 01K9WQH6KTDV11J00V41R5H0SP"
v3 · 11/12/2025, 7:08:47 PM · View this version
"Set parent to 01K9WQGRT6CVXW2Y0VWFZMK2XZ"
v2 · 11/12/2025, 7:08:46 PM · View this version
"Added children"
v1 · 11/12/2025, 7:08:44 PM · View this version
"Initial snapshot"

Additional Components

reorganization-description.txt

# Reorganization Summary

The organizational strategy focuses on separating content-related files from system-generated files. The 'newsletter_posts' group contains the CSV file with details about newsletter posts, which is logically distinct from the 'system_files' group that includes metadata files like .DS_Store.ref.json. This approach ensures clarity and avoids unnecessary duplication of files across groups.

## Groups Created

- **newsletter_posts**: Files containing data about newsletter posts, including post IDs, dates, publication status, and audience.
- **system_files**: System-generated files, typically used for storing metadata or configurations.

Parent

↑ 01K9WQGRT6CVXW2Y0VWFZMK2XZ