Enhanced content extraction and file type coverage for DLP on Windows devices
Microsoft Purview: Microsodt announced upcoming enhancements to Microsoft Purview Data Loss Prevention (DLP). With the forthcoming update, the capability to scan, classify, and protect sensitive content on Windows endpoint devices will be significantly expanded. The number of supported file types will increase from approximately 40 to over 100, aligning endpoint coverage with other platforms like Exchange, SharePoint, and OneDrive.
Additionally, this update will introduce several key enhancements, including:
- Detection of labels from protected files (pfiles).
- Identification of sensitive content within file metadata.
- Recognition of sensitive information in PDF form fields.
- Detection of sensitive information in files embedded inside office files (for example, a .txt file inside .pptx file)
When this will happen:
Public Preview: Microsoft will begin rolling out late June 2024 and expect to complete by early October 2024.
General Availability Worldwide: Microsoft will begin rolling out early October 2024 and expect to complete by late October 2024.
How this will affect your organization:
The upcoming update will enhance DLP’s content scanning on Windows devices. No changes to existing policies are required.
Summary of enhancements:
- Enhanced file type coverage
The file type coverage to scan, classify, and protect sensitive content on Windows Endpoint devices will increase from 40 file types to over 100.
This means that sensitive content in additional file types like BZ2, EML etc. will also start getting scanned and protected using DLP policies.
- Detect label in PFILE
The DLP condition “content contains sensitivity label” now has the capability to detect labels from protected files (pfiles). This means that it can now read labels not just from Office and PDF files, but all other files where MIP label with protection can be applied via applications like AIP client, Secude etc. which converts the file into “pfile”.
A txt file converted to .ptxt (PFile) after applying a label. This label can now be detected with this preview.
- Scanning metadata
Ability to detect sensitive content in file metadata like custom properties in Office and PDF files.
- Scanning content embedded in Microsoft 365 office files
If a file is embedded inside an office file (Microsoft Word/Excel/PowerPoint), the content of the embedded file is also scanned. For example, if a DOCX file containing credit card numbers is inserted into an XLSX file, the content of both XLSX and the embedded DOCX files will be scanned, and credit card numbers will be detected.
- Better scanning with PDF files
· Ability to scan and detect sensitive content in PDF forms.
· Ability to scan and detect sensitive content in permission protected PDF files. Permission protected PDF files are ones which do not require any password to open the file and read the content but require a password to edit/copy the content.
What you need to do to prepare:
You do not need do any changes to your existing policies. Your existing policies will seamlessly start scanning additional content as detailed above.
Additional Resources
No Comments