Configuration Example

This document provides a simple example of how to configure the SimpleDocsExtractor.

Basic Configuration
Configuration Breakdown
Usage

Basic Configuration

Here’s a minimal configuration that processes TypeScript and JavaScript files:

import { TagExtractorPlugin } from "@/simple-docs-scraper/extractors/TagExtractorPlugin.js";
import { RecommendedFormatters } from "@/simple-docs-scraper/formatters/RecommendedFormatters.js";
import { SimpleDocExtractorConfig } from "@/simple-docs-scraper/types/config.js";
import path from "path";

const config: SimpleDocExtractorConfig = {
  baseDir: process.cwd(),
  targets: [
    {
      globOptions: {
        cwd: path.join(process.cwd(), "src"),
        extensions: "**/*.{js,ts}",
        ignore: ["**/tests/**", "**/scripts/**"],
      },
      outDir: path.join(process.cwd(), "docs"),
      createIndexFile: true,
      flatten: false, // Set to true to flatten nested directory structure
      extraction: [
        new TagExtractorPlugin({
          tag: "docs",
          searchAndReplace: "%content%",
        }),
      ],
    },
  ],
  formatters: RecommendedFormatters.recommended(),
};

Configuration Breakdown

Required Properties

baseDir: The base directory for your project (usually process.cwd())
targets: Array of target configurations defining what files to process

Target Configuration

Each target specifies:

globOptions: File matching rules
- cwd: Directory to search in
- extensions: File types to process (e.g., "**/*.{js,ts}")
- ignore: Patterns to exclude (e.g., ["**/tests/**"])
outDir: Where to save generated documentation
createIndexFile: Whether to create an index.md file
extraction: How to extract documentation from source files
flatten: Whether to flatten the index structure (shows all nested files in a single list)

Extractor Plugins

The most common extractor is TagExtractorPlugin, which extracts content between HTML-like tags:

// In your source code:
/**
 * <docs>
 * This documentation will be extracted.
 * </docs>
 */

// In your config:
new TagExtractorPlugin({
  tag: "docs",
  searchAndReplace: "%content%",
})

Formatters

Formatters clean up the extracted content. Common ones include:

RemoveMultiLineCommentAsterisks: Removes comment formatting
AddDoubleLinesFormatter: Adds spacing between content

Alternative, you can use the recommended formatters which include the above formatters.

Use the RecommendedFormatters class to get the recommended formatters.

Flatten Feature

The flatten option controls how nested directory structures are displayed in index files:

flatten: false (default): Creates separate index files for each directory level
flatten: true: Shows all files and directories in a single flattened list with proper indentation

Example with flatten enabled:

{
  outDir: path.join(process.cwd(), "docs"),
  createIndexFile: true,
  flatten: true, // Flattens the directory structure
  extraction: [
    new TagExtractorPlugin({
      tag: "docs",
      searchAndReplace: "%content%",
    }),
  ],
}

This will create a single index file showing all nested files with proper indentation levels.

Usage

import { SimpleDocExtractor } from "@/simple-docs-scraper/index.js";

const extractor = new SimpleDocExtractor(config);
const result = await extractor.start();

For more advanced configuration options, see advanced-config.md.

Configuration Example

Documentation solution for writing documentation directly in your source code

Configuration Example

Table of Contents

Basic Configuration

Configuration Breakdown

Required Properties

Target Configuration

Extractor Plugins

Formatters

Alternative, you can use the recommended formatters which include the above formatters.

Flatten Feature

Usage