Integrations
SharePoint

SharePoint Integration

No sections found for this integration
The integration documentation may not have the expected structure

Overview

The SharePoint integration in Lamatic automates document syncing and processing from Microsoft SharePoint sites. It supports various file types and provides secure integration with Lamatic Flow for automated document intelligence and RAG workflows.

This integration connects to your organization's SharePoint instance to sync documents for processing in Lamatic Flow.

Features

✅ Key Functionalities

  • Document Syncing: Automatically syncs documents from SharePoint sites and folders
  • File Type Support: Handles PDFs, Word documents, Excel files, and other compatible formats
  • Scheduled Processing: Supports automated sync schedules with cron expressions
  • Selective Filtering: Use glob patterns to filter specific file types and paths

✅ Benefits

  • Automates document collection from SharePoint repositories
  • Enables RAG workflows with organizational knowledge
  • Provides granular control over file selection and processing
  • Supports both incremental and full-refresh synchronization modes

Available Functionality

Event Triggers

✅ Scheduled document syncing from SharePoint sites
✅ Support for multiple file types (PDF, DOCX, XLSX, etc.)
✅ Folder-specific monitoring and filtering
✅ Incremental and full-refresh sync modes

Actions

✅ Parse and extract text from documents
✅ Vectorize content for RAG workflows
✅ Filter files using glob patterns
✅ Schedule automated sync operations

Prerequisites

Before setting up the SharePoint integration, ensure you have:

  • A Microsoft 365 account with SharePoint access
  • Appropriate permissions to access SharePoint sites and folders
  • Your organization's Tenant ID from Microsoft Entra Admin Center
  • Understanding of SharePoint site structure and file organization

Setup

Step 1: Set Up Microsoft 365 Credentials

  1. Get Tenant ID: Navigate to Microsoft Entra Admin Center (opens in a new tab)
  2. Access Azure Active Directory: Go to Azure Active Directory section
  3. Copy Tenant ID: Under Tenant Information, copy the Tenant ID (also called Directory ID)
⚠️

Ensure you have appropriate SharePoint access permissions for the sites you want to sync.

Step 2: Configure SharePoint Node

  1. Add SharePoint Node: Drag the SharePoint node to your flow
  2. Enter Credentials: Provide your Microsoft 365 Tenant ID
  3. Configure Site URL: Enter the SharePoint site URL you want to sync from
  4. Set Folder Path: Specify the folder path within the site (use "." for all folders)

Step 3: Test and Deploy

  1. Test Connection: Verify the node can access your SharePoint site
  2. Configure Sync Settings: Set up sync mode, schedule, and file filters
  3. Deploy Flow: Activate the flow to start syncing documents

Configuration Reference

SharePoint Node Parameters

ParameterDescriptionRequiredDefaultExample
Site URLURL of the SharePoint site to search for files-https://lamatic.sharepoint.com/sites/test
Folder PathPath to a specific folder within the drive. Use "." to search all folders-./TeamDocs/HR
Globs (Path Patterns)Pattern to filter specific files based on extension or name****/*.pdf, **/*.docx
Sync ModeSync behavior: full_refresh or incrementalincrementalincremental
Sync ScheduleCron expression for scheduled syncs-0 0 * * * (daily at midnight)
Search ScopeLocation to search: ACCESSIBLE_DRIVES, SHARED_ITEMS, or ALLALLALL
Parsing StrategyDocument parsing method: fast, ocr_only, or hi_resfasthi_res
Days To Sync If History Is FullLimit sync to files modified in the last N days if sync state is full3030
Start DateIgnore files modified before this UTC datetime (ISO format)-2017-01-25T00:00:00.000000Z

Sync Configuration Options

Sync Modes

# Incremental Sync (recommended)
sync_mode: "incremental"  # Only sync new/modified files
 
# Full Refresh
sync_mode: "full_refresh"  # Re-index all files

Schedule Examples

# Daily at midnight
sync_schedule: "0 0 * * *"
 
# Every 6 hours
sync_schedule: "0 */6 * * *"
 
# Weekdays only at 9 AM
sync_schedule: "0 9 * * 1-5"

File Filtering Patterns

Common Glob Patterns

# All PDF files
globs: "**/*.pdf"
 
# All Word and Excel files
globs: "**/*.docx", "**/*.xlsx"
 
# Files in specific folders
globs: "HR/**/*", "Legal/**/*"
 
# Exclude draft folders
globs: "**/*", "!**/draft/**"

Search Scope Options

  • ACCESSIBLE_DRIVES: Only files in drives you have direct access to
  • SHARED_ITEMS: Files shared with you by others
  • ALL: All accessible files (recommended)

Usage Examples

Basic SharePoint Sync

# Basic configuration for syncing all documents
site_url: "https://company.sharepoint.com/sites/documents"
folder_path: "."
globs: "**/*.pdf", "**/*.docx"
sync_mode: "incremental"
search_scope: "ALL"
parsing_strategy: "fast"

Advanced Configuration

# Advanced setup with scheduling and filtering
site_url: "https://company.sharepoint.com/sites/knowledge"
folder_path: "./TeamDocs"
globs: "**/*.pdf", "**/*.docx", "!**/draft/**"
sync_mode: "incremental"
sync_schedule: "0 2 * * *"  # Daily at 2 AM
search_scope: "ALL"
parsing_strategy: "hi_res"
days_to_sync_if_history_full: 30
start_date: "2024-01-01T00:00:00.000000Z"

Selective Document Sync

# Sync only specific document types from HR folder
site_url: "https://company.sharepoint.com/sites/hr"
folder_path: "./Policies"
globs: "**/*.pdf", "**/*.docx"
sync_mode: "incremental"
search_scope: "ACCESSIBLE_DRIVES"
parsing_strategy: "ocr_only"  # Better for scanned documents

Troubleshooting

Common Issues

ProblemSolution
Authentication FailedVerify Tenant ID is correct and you have SharePoint access permissions
Site Not FoundCheck the Site URL format and ensure you have access to the specified site
Files Not SyncingVerify folder path exists and glob patterns are correctly formatted
Permission DeniedEnsure your Microsoft 365 account has appropriate SharePoint permissions
Sync Not ScheduledCheck cron expression format and ensure sync schedule is properly configured

Debugging Steps

  1. Verify Credentials: Test your Microsoft 365 credentials and Tenant ID
  2. Check Site Access: Ensure you can access the SharePoint site in your browser
  3. Validate Folder Path: Confirm the folder path exists and is accessible
  4. Test Glob Patterns: Verify file filtering patterns match your documents
  5. Check Sync Logs: Review Lamatic Flow logs for detailed error information

Best Practices

  • Use incremental sync mode for better performance
  • Implement specific glob patterns to avoid syncing unnecessary files
  • Schedule syncs during off-peak hours to minimize impact
  • Use hi_res parsing for scanned documents and images
  • Regularly monitor sync logs for any issues
  • Set appropriate days_to_sync_if_history_full to limit historical data

Example Use Cases

Document Intelligence Workflows

  • Company Policies: Sync HR policies and procedures for automated Q&A
  • Knowledge Bases: Index departmental wikis and team documentation
  • Compliance Documents: Process legal and audit-related content
  • Project Documentation: Automate access to project files and reports

RAG Applications

  • Semantic Search: Enable natural language search across organizational documents
  • Question Answering: Build AI assistants that can answer questions about company policies
  • Document Summarization: Automatically summarize lengthy reports and documents
  • Content Discovery: Help users find relevant information across SharePoint sites

Related Resources

Was this page useful?

Questions? We're here to help

Subscribe to updates