Issues with Data Discovery Jobs when running a Full Scan Type(for Privacy/Classification) Yokohama
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
3 hours ago
When creating a data discovery job with a scan type of "Full", we are running into 2 issues.
1. When running a scan on a single table, the scan is not covering all records on the table. It will run a small number, 3 or 4.
2. When running larger jobs with multiple tables, the jobs are getting hung up and are not completing, they just keep running. I have not been able to determine what the maximum table number is, if there is one, for these jobs.
Any assistance appreciated!
Thank you,
Heather
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
30m ago
Hi @Heather White ,
We’ve seen similar behavior with Data Discovery (Privacy/Classification) in Yokohama, and it usually comes down to platform limits + execution model, rather than the scan type itself
Best Practice Approach
1. Check and Tune System Limits
Review system properties related to Data Discovery / Classification:
Max records per table
Batch size
Worker threads
“Full” scan does not always bypass these limits unless tuned.
2. Avoid Large Multi-Table Jobs
Do not include too many tables in a single job
Recommended:
5–10 tables per job max
This significantly improves completion rate
3. Run Jobs in Controlled Batches
Schedule jobs in sequence, not parallel
Avoid overlap with:
Discovery
Imports
Other heavy background jobs
4. Validate Table Size & Performance
Large tables without proper indexing can slow down scans
Check execution details to confirm:
Records picked vs processed
5. Monitor Execution
Use:
System Diagnostics → Stats
Job execution logs
Look for long-running or stuck workers
If this helps, please mark it as Helpful and Accept as Solution.
Thanks!
