Metadata Management

Organize files with tags, properties, and classifications

Datar Drive provides robust metadata capabilities that help you categorize, organize, and find files efficiently. This guide explains how to leverage file metadata to improve organization and searchability.

Metadata Structure

Datar manages several types of metadata for files and folders, providing a comprehensive system for organization and information management.

metadata-example.json

Metadata Types

System Metadata: Automatically generated technical information
Tags: Key-value pairs for simple categorization
Custom Metadata: User-defined properties for specific needs
Extracted Metadata: Information parsed from file contents
Categorization: Formal classification of content

Tagging System

Tags provide a flexible way to categorize files and folders with simple key-value pairs.

Tag Structure

Understanding how tags work:

Key-value pairs (e.g., "department:finance")
Multiple tags per file or folder
Searchable and filterable
Inheritance options from parent folders
Controlled vocabularies (optional)
Tag colors and icons for visual organization

Common Use Cases

Popular tagging strategies:

Department or team identification
Project association
Content status (draft, review, final)
Content type (report, presentation, contract)
Time-based organization (quarter, year)
Priority or importance level

Custom Metadata

Custom metadata provides a structured way to add specific properties to files and folders based on your organization's needs.

Creating Metadata Schemas

Metadata schemas define structured sets of properties for different file types:

Schema Definition: Administrators can create and manage metadata schemas
Field Types: Support for text, number, date, selection, user, and more
Validation Rules: Ensure data integrity with format and value constraints
Required Fields: Specify which metadata must be provided
Default Values: Pre-populate fields for faster data entry
Schema Assignment: Apply schemas to folders, file types, or categories

[Metadata Schema Editor Interface]

Content Extraction

Datar can automatically extract information from file contents to create additional metadata.

Automatic Extraction

Information extracted from files:

Document properties (title, subject, author)
Text content for full-text search
Images and visual content
Tables and structured data
Technical metadata (dimensions, duration, etc.)
Embedded metadata from specialized formats

Advanced Processing

Enhanced extraction capabilities:

Text recognition (OCR) for images and scans
Entity detection (people, organizations, dates)
Topic and keyword extraction
Language detection
Sentiment analysis
Custom extraction patterns

Content Extraction by File Type

File Format	Extractable Data
Office Documents (DOCX, XLSX, PPTX)	Title, author, company, keywords, comments, content, revision data
PDF	Author, title, subject, keywords, creation/modification dates, page count, content
Images (JPG, PNG, TIFF)	EXIF data (camera, date, location), dimensions, color profile, text via OCR
Audio/Video	Duration, bitrate, resolution, embedded metadata, transcription via speech recognition
Email (MSG, EML)	Sender, recipients, date, subject, body text, attachments

Content extraction happens automatically during upload but can also be triggered manually for existing files or when extraction capabilities are updated.

Content Classification

Classification provides a formal taxonomy for organizing content in a consistent, controlled way.

Classification System

How classification works:

Hierarchical categories and subcategories
Controlled vocabulary with defined terms
Multiple classification dimensions
Classification rules and policies
Required vs. optional classifications
Classification inheritance

Classification Benefits

Why use formal classification:

Consistent organization across the system
Improved compliance with records policies
Better governance and content management
Enhanced discoverability of information
Clear information architecture
Support for retention and disposition

Classification Example Taxonomy

Document Types

Financial Documents
- Reports
- Statements
- Budgets
- Invoices
- Tax Documents
Human Resources
- Policies
- Employee Records
- Benefits Information
- Hiring Documents
- Training Materials
Legal Documents
- Contracts
- Agreements
- Compliance Documents
- Regulatory Filings
- Intellectual Property
Marketing Materials
- Presentations
- Brochures
- Campaign Assets
- Brand Guidelines
- Market Research

Confidentiality Levels

Public
Internal Use Only
Confidential
Restricted
Top Secret

Departments

Executive
Finance
Human Resources
Information Technology
Legal
Marketing
Operations
Research & Development
Sales

[Classification Management Interface]

Metadata Integration

Datar's metadata system integrates with other modules and external systems to maximize usefulness.

Metadata Integration Points

Integration Point	Description	Key Capabilities
Search Engine	Powers comprehensive search across all metadata types	Full-text indexing, faceted search, relevance ranking, search suggestions
Workflow System	Uses metadata to drive business processes	Condition-based routing, approval flows, status tracking, notifications
Records Management	Leverages metadata for records classification	Retention schedules, legal holds, disposition rules, audit trails
Analytics Platform	Analyzes metadata for insights	Usage patterns, content analytics, trend detection, dashboards
External Systems	Shares metadata with other applications	Metadata import/export, syncing, API access, format conversion

Metadata Best Practices

Develop a Metadata Strategy
Create a comprehensive metadata strategy before implementation. Define what metadata is needed, how it will be used, who will maintain it, and how it aligns with business objectives.
Use Consistent Naming Conventions
Establish clear naming conventions for tags, properties, and classification terms. Consistency improves searchability and helps users understand how to apply metadata correctly.
Balance Flexibility and Control
Strike the right balance between controlled vocabularies and free-form metadata. Too much control can frustrate users, while too little leads to inconsistency and reduced findability.
Automate Metadata Capture
Leverage automation to reduce the burden on users. Use content extraction, default values, inheritance, and rules to populate metadata automatically whenever possible.
Focus on Business Value
Prioritize metadata that delivers business value. Each metadata field should serve a specific purpose like improving search, supporting workflows, or meeting compliance requirements.
Train and Support Users
Educate users on the importance of metadata and how to apply it correctly. Provide clear guidelines, templates, and ongoing support to encourage consistent metadata application.

Related Drive Components

File Management
Learn the basics of managing files and folders in the Drive module.
Learn more →
Access Control
Understand how to manage permissions and share files securely.
Learn more →
File Versioning
Track changes and manage file history over time.
Learn more →

Datar Drive

Advanced Features

Integration

Metadata Management

Metadata Structure

Metadata Types

Tagging System

Tag Structure

Common Use Cases

Tags

Custom Metadata

Creating Metadata Schemas

Content Extraction

Automatic Extraction

Advanced Processing

Content Extraction by File Type

Content Classification

Classification System

Classification Benefits

Metadata Integration

Metadata Integration Points

Metadata Best Practices

Develop a Metadata Strategy

Use Consistent Naming Conventions

Balance Flexibility and Control

Automate Metadata Capture

Focus on Business Value

Train and Support Users

Related Drive Components

File Management

Access Control

File Versioning

Was this documentation helpful?