Metadata Management

Organize files with tags, properties, and classifications

Datar Drive provides robust metadata capabilities that help you categorize, organize, and find files efficiently. This guide explains how to leverage file metadata to improve organization and searchability.

Metadata Structure

Datar manages several types of metadata for files and folders, providing a comprehensive system for organization and information management.

metadata-example.json

Metadata Types

  • System Metadata: Automatically generated technical information
  • Tags: Key-value pairs for simple categorization
  • Custom Metadata: User-defined properties for specific needs
  • Extracted Metadata: Information parsed from file contents
  • Categorization: Formal classification of content

Tagging System

Tags provide a flexible way to categorize files and folders with simple key-value pairs.

Tag Structure

Understanding how tags work:

  • Key-value pairs (e.g., "department:finance")
  • Multiple tags per file or folder
  • Searchable and filterable
  • Inheritance options from parent folders
  • Controlled vocabularies (optional)
  • Tag colors and icons for visual organization

Common Use Cases

Popular tagging strategies:

  • Department or team identification
  • Project association
  • Content status (draft, review, final)
  • Content type (report, presentation, contract)
  • Time-based organization (quarter, year)
  • Priority or importance level

Tags

You can add up to 45 more tags.
Datar allows administrators to create standard tag sets that provide consistent categorization across the organization. Users can still add custom tags as needed.

Custom Metadata

Custom metadata provides a structured way to add specific properties to files and folders based on your organization's needs.

Creating Metadata Schemas

Metadata schemas define structured sets of properties for different file types:

  • Schema Definition: Administrators can create and manage metadata schemas
  • Field Types: Support for text, number, date, selection, user, and more
  • Validation Rules: Ensure data integrity with format and value constraints
  • Required Fields: Specify which metadata must be provided
  • Default Values: Pre-populate fields for faster data entry
  • Schema Assignment: Apply schemas to folders, file types, or categories
[Metadata Schema Editor Interface]

Content Extraction

Datar can automatically extract information from file contents to create additional metadata.

Automatic Extraction

Information extracted from files:

  • Document properties (title, subject, author)
  • Text content for full-text search
  • Images and visual content
  • Tables and structured data
  • Technical metadata (dimensions, duration, etc.)
  • Embedded metadata from specialized formats

Advanced Processing

Enhanced extraction capabilities:

  • Text recognition (OCR) for images and scans
  • Entity detection (people, organizations, dates)
  • Topic and keyword extraction
  • Language detection
  • Sentiment analysis
  • Custom extraction patterns

Content Extraction by File Type

File Format
Extractable Data
Office Documents (DOCX, XLSX, PPTX)
Title, author, company, keywords, comments, content, revision data
PDF
Author, title, subject, keywords, creation/modification dates, page count, content
Images (JPG, PNG, TIFF)
EXIF data (camera, date, location), dimensions, color profile, text via OCR
Audio/Video
Duration, bitrate, resolution, embedded metadata, transcription via speech recognition
Email (MSG, EML)
Sender, recipients, date, subject, body text, attachments
Content extraction happens automatically during upload but can also be triggered manually for existing files or when extraction capabilities are updated.

Content Classification

Classification provides a formal taxonomy for organizing content in a consistent, controlled way.

Classification System

How classification works:

  • Hierarchical categories and subcategories
  • Controlled vocabulary with defined terms
  • Multiple classification dimensions
  • Classification rules and policies
  • Required vs. optional classifications
  • Classification inheritance

Classification Benefits

Why use formal classification:

  • Consistent organization across the system
  • Improved compliance with records policies
  • Better governance and content management
  • Enhanced discoverability of information
  • Clear information architecture
  • Support for retention and disposition
Document Types
  • Financial Documents
    • Reports
    • Statements
    • Budgets
    • Invoices
    • Tax Documents
  • Human Resources
    • Policies
    • Employee Records
    • Benefits Information
    • Hiring Documents
    • Training Materials
  • Legal Documents
    • Contracts
    • Agreements
    • Compliance Documents
    • Regulatory Filings
    • Intellectual Property
  • Marketing Materials
    • Presentations
    • Brochures
    • Campaign Assets
    • Brand Guidelines
    • Market Research
Confidentiality Levels
  • Public
  • Internal Use Only
  • Confidential
  • Restricted
  • Top Secret
Departments
  • Executive
  • Finance
  • Human Resources
  • Information Technology
  • Legal
  • Marketing
  • Operations
  • Research & Development
  • Sales
[Classification Management Interface]

Metadata Integration

Datar's metadata system integrates with other modules and external systems to maximize usefulness.

Metadata Integration Points

Integration Point
Description
Key Capabilities
Search Engine
Powers comprehensive search across all metadata types
Full-text indexing, faceted search, relevance ranking, search suggestions
Workflow System
Uses metadata to drive business processes
Condition-based routing, approval flows, status tracking, notifications
Records Management
Leverages metadata for records classification
Retention schedules, legal holds, disposition rules, audit trails
Analytics Platform
Analyzes metadata for insights
Usage patterns, content analytics, trend detection, dashboards
External Systems
Shares metadata with other applications
Metadata import/export, syncing, API access, format conversion

Metadata Best Practices

  1. Develop a Metadata Strategy

    Create a comprehensive metadata strategy before implementation. Define what metadata is needed, how it will be used, who will maintain it, and how it aligns with business objectives.

  2. Use Consistent Naming Conventions

    Establish clear naming conventions for tags, properties, and classification terms. Consistency improves searchability and helps users understand how to apply metadata correctly.

  3. Balance Flexibility and Control

    Strike the right balance between controlled vocabularies and free-form metadata. Too much control can frustrate users, while too little leads to inconsistency and reduced findability.

  4. Automate Metadata Capture

    Leverage automation to reduce the burden on users. Use content extraction, default values, inheritance, and rules to populate metadata automatically whenever possible.

  5. Focus on Business Value

    Prioritize metadata that delivers business value. Each metadata field should serve a specific purpose like improving search, supporting workflows, or meeting compliance requirements.

  6. Train and Support Users

    Educate users on the importance of metadata and how to apply it correctly. Provide clear guidelines, templates, and ongoing support to encourage consistent metadata application.

Related Drive Components

  1. File Management

    Learn the basics of managing files and folders in the Drive module.

  2. Access Control

    Understand how to manage permissions and share files securely.

  3. File Versioning

    Track changes and manage file history over time.

Was this documentation helpful?