Creating a dataset using Now Assist Skill Kit

Australia Enable AI

Release

australia

ft:locale

ko-KR

ft:publication_title

Australia Enable AI

ft:clusterId

platai

bundleId

platai

workflow

Platform

Creating a dataset using Now Assist Skill Kit

릴리스 버전: Australia

업데이트 날짜 2026년 03월 12일

소요 시간: 3분

Use these guidelines to create an effective dataset. Having an effective dataset provides better results for your prompt.

Now Assist Skill Kit dataset creation overview

A data-driven approach to skill development relies on the collection of a high-quality dataset to develop and test the skill. When you use Now Assist Skill Kit, you can also leverage the existing capabilities of the ServiceNow AI Platform to create a high-quality dataset.

When collecting data for this purpose, you should aim to create datasets that are:

Representative of the skill’s intended deployment environment. The data should:
- Seek to reflect the expected distribution of inputs in the deployment environment.
- Capture variance along several identified axes, for example, input length, urgency.
- Include any examples of inputs that are known to be important to the use case.
- Consider edge cases (which may be rare) but that are suspected to cause problems, for example, long examples.
Sized appropriately for the team’s risk appetite.
- It’s possible to develop and deploy a skill with little data. However, a lack of data creates more uncertainty about how the skill performs in deployment.
- You should think like statisticians and produce confidence intervals for any associated performance scores and prompt comparisons.
Isolated from the data used for developing and writing the prompts.
- You should split the data collected into development and testing sets. By splitting the data, you are protecting some data solely for evaluation purposes.
- If you use all the data during the process of developing the prompt, your final evaluation of the skill is biased, meaning that it over-reports performance. This bias is because of a phenomenon known as prompt overfitting.