Automate repetitive tasks by auto-capturing steps in Agentic Desktop

  • 릴리스 버전: Australia
  • 업데이트 날짜 2026년 03월 12일
  • 소요 시간: 11분
  • Create desktop actions by auto-capturing steps to automate repetitive tasks in Agentic Desktop. You can save the steps that you perform on the application elements as a reusable desktop action of type UI block.

    시작하기 전에

    To access the Agentic Desktop functionality, perform the following steps:

    Confirm that the following system requirements are met:

    • Windows 11 operating system is used.
    • A .NET 9.0 runtime v9.0.10 and .NET 9 Desktop Runtime v9.0.10 is installed.
    • No extended monitors are connected.
    • Theme must match between the systems used for recording and execution.

    Familiarize yourself with the Design workspace and Action recorder. For more information, see Agentic Desktop Design workspace and .

    Role required: sn_aia.admin

    이 태스크 정보

    You can record steps that you want to automate for desktop and web applications and save them as desktop actions of type UI block that AI agents can simulate while executing automations.

    With Action recorder, you can record every step you perform on desktop applications, including clicks, keystrokes, and data entry, along with visual and contextual information. This detailed recording of steps allows automated tasks to reliably replicate your interactions.

    주:
    If your automation requires manual inputs, such as entering an OTP or CAPTCHA, you must provide instructions to the AI Agent to wait for the user input during execution. Otherwise, the automation can't proceed.

    프로시저

    1. From your Windows system, launch the Agentic Desktop application.
    2. On the login page, in the Add ServiceNow URL field, enter the ServiceNow instance URL.
      For example, https://<instance name>.service-now.com.
      Agentic Desktop login screen for entering ServiceNow instance URL.
    3. Select Proceed.
    4. Log in to your ServiceNow account by entering your user name and password.
      Your account must have the sn_aia.admin role.

      Login window for entering your ServiceNow account username and password.

    5. On the Agentic Desktop home page, select Create desktop action.
      Agentic Desktop home page displaying the Create desktop action UI action, search and select options, and cards of existing desktop actions.
    6. In the New desktop action dialog box, select Auto-capture steps (recommended).
      New desktop action window with auto-capture option selected and fields to enter name and description for the desktop action.
    7. Enter a name and description for the desktop action.
    8. Select Start capturing.

      The Agentic Desktop window is minimized and the Action recorder panel is launched.

      Floating recorder panel that has Discard, Pause, and Start recording UI actions.
    9. Open the applications that you want to record steps for.
    10. From the Action recorder panel, select Start recording.
      You’ll see a "Recording started" message on the Action recorder panel. You can select any of the following options when needed from the More options menu:
      • Pause: Skip recording steps
      • Restart: Restart recording the steps
      • Discard: Discard the recording if it doesn’t meet your needs
    11. Perform the steps that you want to automate.

      Each step that you perform is captured sequentially and the type of UI action is displayed for each step. For example, Capturing Mouse Left Click event.

      Avoid performing steps too quickly while recording because the recorder may miss capturing steps.
    12. After you’re done with all the steps, select End recording on the Action recorder panel.

      You’ll see the "Drafting recorded action(s)" message on the Action recorder panel.

      The recorded steps are displayed as screenshots in the Design workspace with anchors and steps automatically added.

    13. Review the sequence of captured screens and anchors.
      You can adjust the sequence by dragging the screens and anchors in the Screens and steps panel.
    14. Review the location of anchors and adjust as necessary.
      주:
      The auto-anchoring accuracy depends on the application's accessibility, performance, and UI complexity.
    15. Review and adjust steps.
      You can modify the sequence by adding, removing, or dragging steps.
      1. From the Anchor control menu, select the Add step icon .
      2. Select the type of step to perform for this step from the contextual menu.
        Screen capture of an app with anchor added, displaying various type of input and output steps.
        표 1. Description of the actions
        Goal Step Type Example
        Enter text in a field Set Text Input Enter any text data such as a user name, an address, a survey response, or in any situation where text entry is accepted.
        주:
        If you set a static value for this field, the automation uses it during execution and doesn’t prompt you for input from the Now Assist panel.
        Simulate a mouse click Click Input Click a button, open a menu, or perform any step typically performed by a mouse click.
        Simulate an alternative mouse action (for example, right-click, drag, scroll, or paste) Mouse Click Input Perform various mouse device actions, such as right-click and select an object or scroll on a web page.
        Simulate a key press or a key function Send Keys Input Perform keyboard shortcuts, such as copying text by entering Ctrl + C on fields and elements.
        주:
        If you set a static value for this field, the automation uses it during execution and doesn’t prompt you for input from the Now Assist panel.
        Capture text from a window or web page Get Text Output Receive text from the source area.
        Capture a table Get Table Output Receive table from the source area when the text is in the table format.
        주:
        For the step to capture table data successfully, the data must already be in the table form. The step can’t convert ordinary text to table data.
        Read text from an image OCR Read Text Output Recognize text from images and return it in the standard text format.

        You can add multiple steps representing your automation steps.

    16. Configure the properties for added screens, anchors, and steps.
    17. Provide names for all added screens, anchors, and steps.

      When you create these elements, you can edit the auto-generated name, but follow these naming guidelines.

      • Name fields must not be empty.
      • Name fields must contain only alphanumeric characters. Spaces and special characters are not permitted.
      • Each name must be unique at its respective parent level.
        • Each screen must have a unique name at the desktop-action level.
        • Each anchor must have a unique name at the screen level.
        • Each step must have a unique name at the anchor level.

    다음에 수행할 작업

    1. Configure the details of your desktop action. For more information, see Add details to desktop actions in Agentic Desktop.
    2. Test and activate the desktop action so that it can be added as a tool to AI agents. For more information, see Test and activate a desktop action in Agentic Desktop.
    3. Add the desktop action as a tool to AI agents in AI Agent Studio. For more information, see Add a desktop action to an AI agent.