The content discovery gives you a complete picture of all files across your entire Microsoft 365 tenant — including all SharePoint sites and Teams document libraries, and OneDrive accounts. Run this discovery before migrating to gain full insight into your tenant content, storage footprint, and metadata landscape.
Content Governance scans the tenant and creates one job item per library in the Job list. Each job is created in the context of the source tenant, and is pre-configured for the subsequent migration action — the same job item that drives the discovery scan is later reused to drive the migration, so jobs do not need to be recreated when moving from discovery to migration. Each job item represents a single library or OneDrive. The following information is collected for every file:
|
File name, file type, folder path, created date, created by, last modified date, and last modified by. |
|
The total number of versions and the combined size of all versions — reflecting the true storage footprint of each file, not just the current version. |
|
Retention label and sensitivity label, enabling compliance-driven analysis and migration scoping directly in SMART BI. |
|
The SharePoint content type assigned to each file, available for filtering and grouping in all SMART BI reports. |
|
All individual custom metadata columns defined in each library are collected and stored — regardless of column type or naming — and made available for filtering and analysis in SMART BI. |
⏱ Scanning large tenants is time consuming. Due to the depth of data collection — including version history and all metadata for every file — scanning a large Microsoft 365 tenant can take anywhere from several hours to several days. Plan accordingly and consider running the scan outside business hours or over a weekend.
*Scan progress is tracked in real time in SMART BI — the Content Governance reporting solution built on Microsoft Power BI Desktop. Monitor all running scan jobs, files processed, volume collected, and performance rates (files, versions, and MB per hour) across your entire tenant directly in Power BI Desktop. You can also monitor progress on this page — see the Scan progress section below.
*Once the scan is complete, explore your data using the built-in SMART BI report visuals: Decomposition Tree — drill into volume or file counts by any dimension such as file type, library, modified by, or retention label. Matrix view — pivot data across libraries, file types, or metadata values. Slicers — filter by modified date, version count, volume, content type, retention label, sensitivity label, or any custom metadata value.
| Scan scope |
| Teams Libraries / SharePoint Libraries |
Content Governance discovers all SharePoint Document Libraries and Teams document libraries across the entire tenant. One scan job is created per library, so each library is scanned and reported on individually in SMART BI. |
| OneDrive |
OneDrive accounts are not auto-identified during the tenant scan. They are configured separately in the OneDrive dialog — see OneDrive configuration below. One scan job is created per mapped OneDrive, giving individual visibility into every user’s storage footprint, file activity, and metadata landscape. |
| OneDrive configuration |
|
After the tenant scan step, click Next to open the OneDrive configuration dialog. OneDrive accounts are configured here because each OneDrive must be paired with a target Nextcloud user and an app password before it can be discovered and migrated. |
|
Content Governance matches each Microsoft 365 user to its corresponding Nextcloud username automatically. Use this when M365 and Nextcloud identities follow a consistent naming convention across both systems. |
|
Map M365 users to Nextcloud users individually with manual intervention. Use this when identities differ between systems, or when you want to scope OneDrive discovery and migration to a specific subset of users. |
|
For each mapped user, supply the Nextcloud app password used to authenticate WebDAV access during the scan and the subsequent migration. |
| SharePoint metadata — automatic MetaVox column creation |
| Automatically create SharePoint custom columns in MetaVox app |
Applies only when the job is reconfigured from discovery to migration; this option has no effect during the discovery scan itself. When enabled, Content Governance inspects every SharePoint library and reads all custom columns defined there — including Choice, Text, Date, Numeric, and Managed Metadata column types. For each library, Content Governance automatically creates matching metadata columns in the MetaVox app in Nextcloud, maps and transfers all source values from SharePoint to MetaVox during migration, and creates a MetaVox view so the columns are immediately visible and usable in Nextcloud. This means your entire SharePoint metadata landscape is fully preserved in Nextcloud — with zero manual column configuration required. You can set the option now and it will be carried into the migration phase, or leave it for when you reconfigure the job. |
| SharePoint views — automatic MetaVox view creation |
| Automatically recreate SharePoint views in MetaVox app |
When automatic column creation is enabled, Content Governance additionally reads every view defined on each SharePoint library and recreates it as a corresponding view in the MetaVox app in Nextcloud. For each SharePoint view, Content Governance creates a matching MetaVox view with the same columns in the same order and applies the first sort column from the SharePoint view’s sort specification — preserving the view definition to the extent that MetaVox currently supports. Your users land in Nextcloud with the same list layouts they already know, with zero manual view configuration required. |
*View recreation maps what MetaVox supports today: column selection, column order, and the primary sort column. Additional SharePoint view features such as multi-column sort, filters, grouping, and column widths are not transferred, as MetaVox does not yet expose equivalent functionality.
| Job status |
| Ready |
Scan jobs are set to Ready immediately after creation and will be picked up and executed by Content Governance automatically. Choose this option to start collecting Business Intelligence data right away. |
| Hold |
Scan jobs are created but remain paused. No scanning takes place until you explicitly start them from the Job list. Choose this option if you want to review or adjust jobs before the scan begins. |
⚠ Permissions required — The scan requires an app registration with sufficient permissions to read all SharePoint sites and OneDrive accounts in the tenant. Without the correct permissions, the scan for Libraries or OneDrive accounts will return an error. Click here for a step-by-step guide