- description
- # Contents
## Overview
This entity is a **table of contents (ToC)** extracted from the text file `tom_sawyer.txt`, containing the chapter listing and illustrations for the novel *The Adventures of Tom Sawyer* by Mark Twain. It spans lines 38 to 460 in the source file and was automatically extracted on January 28, 2026, as part of a structural analysis process. The ToC is divided into two textual chunks for processing and is part of the [Test Collection](arke:01KG2T49K0H5GDRB0G4YDTPG8H).
## Context
The table of contents appears in sequence after the [Project Gutenberg eBook Information](arke:01KG2TRBFWYFR7HBQ0DTZAKJNW) and directly precedes the [Preface](arke:01KG2TRBF6C7EX6WP1HMZK6YKN) in the digital structure of the book. It is embedded within the full text of [The Adventures of Tom Sawyer](arke:01KG2TP9MA26GMS73H3R2KPN3R), which is itself derived from the plain-text file [tom_sawyer.txt](arke:01KG2T4RHC4E1XKJ12BJRXE8E8). The extraction was performed by an automated system ("structure-extraction-lambda") and later reviewed manually.
## Contents
The table of contents lists all 35 chapters of the novel, from "CHAPTER I. Y-o-u-u Tom—Aunt Polly Decides Upon her Duty…" through "CHAPTER XXXV. A New Order of Things—Poor Huck—New Adventures Planned," followed by "CONCLUSION." Each entry includes a brief thematic subtitle. Additionally, the ToC includes a section titled "ILLUSTRATIONS" that lists over 100 captioned images from the original publication, such as "Tom Sawyer," "Huckleberry Finn," "The Graveyard," "McDougal’s Cave," and "The Escape from the Cave," offering insight into key scenes and characters depicted in the book. The content is split across two data chunks: [Chunk 1](arke:01KG2TSH1Y359BYVEEMXY85EK1) covers the chapters and initial illustrations, while [Chunk 2](arke:01KG2TSH2H7PJ5BS3WJN6KK1QS) completes the illustration list.
- description_generated_at
- 2026-01-28T17:38:32.903Z
- description_model
- Qwen/Qwen3-235B-A22B-Instruct-2507
- description_title
- Contents
- end_line
- 460
- extracted_at
- 2026-01-28T17:34:54.480Z
- extracted_by
- structure-extraction-lambda
- start_line
- 38
- text
- null
- title
- Contents