ServiceNow Research

Scaling up ML-based Black-box Planning with Partial STRIPS Models


A popular approach for sequential decision-making is to perform simulator-based search guided with Machine Learning (ML) methods like policy learning. On the other hand, model-relaxation heuristics can guide the search effectively if a full declarative model is available. In this work, we consider how a practitioner can improve ML-based black-box planning on settings where a complete symbolic model is not available. We show that specifying an incomplete STRIPS model that describes only part of the problem enables the use of relaxation heuristics. Our findings on several planning domains suggest that this is an effective way to improve ML-based black-box planning beyond collecting more data or tuning ML architectures.

ICAPS'22 Workshop on Reliable Data-Driven Planning and Scheduling
Hector Palacios
Hector Palacios
Research Scientist

Research Scientist at Human Decision Support located at Montreal, QC, Canada.