Automate repetitive tasks by auto-capturing steps in AI Desktop Actions

  • Release version: Zurich
  • Updated May 22, 2026
  • 6 minutes to read
  • Create desktop actions by auto-capturing steps to automate repetitive tasks in AI Desktop Actions. You can save the steps that you perform on the application elements as a reusable desktop action of type on-screen task.

    Before you begin

    To access the AI Desktop Actions functionality, perform the following steps:

    Confirm that the following system requirements are met:

    • Windows 11 operating system is used.
    • A .NET 9.0 runtime v9.0.10 and .NET 9 Desktop Runtime v9.0.10 is installed.
    • No extended monitors are connected.
    • Theme must match between the systems used for recording and execution.

    Familiarize yourself with the Design workspace and Action recorder. For more information, see AI Desktop Actions Design workspace and Action recorder in AI Desktop Actions.

    Role required: sn_aia.admin

    About this task

    Note:
    To create desktop actions with AI-assisted anchor validation and automatic screen context generation, use Record with AI instead.

    You can record steps that you want to automate for desktop and web applications and save them as desktop actions of type on-screen task that AI agents can simulate while executing automations.

    With Action recorder, you can record every step you perform on desktop applications, including clicks, keystrokes, and data entry, along with visual and contextual information. This detailed recording of steps allows automated tasks to reliably replicate your interactions.

    Note:
    If your automation requires manual inputs, such as entering an OTP or CAPTCHA, you must provide instructions to the AI Agent to wait for the user input during execution. Otherwise, the automation can't proceed.

    Procedure

    1. From your Windows system, launch the AI Desktop Actions application.
    2. On the login page, in the Add ServiceNow URL field, enter the ServiceNow instance URL.
      For example, https://<instance name>.service-now.com.
      AI Desktop Actions login screen for entering ServiceNow instance URL.
    3. Select Proceed.
    4. Log in to your ServiceNow account by entering your user name and password.
      Your must have the sn_aia.admin role.

      Login window for entering your ServiceNow account username and password.

    5. Optional: On the onboarding journey modal, complete the onboarding and select Get started.
      Onboarding journey widget with five pages to show you the highlights of the application.

      If you launch the AI Desktop Actions for the first time, the onboarding journey widget appears. You can select Don't show me again to hide the widget the next time you launch AI Desktop Actions or Skip intro to skip the onboarding.

    6. On the AI Desktop Actions home page, select Create desktop action.
      AI Desktop Actions home page displaying the Create desktop action UI action, search and select options, and cards of existing desktop actions.
    7. In the Create Desktop Action dialog, clear the Record with AI (recommended) check box.
      Create desktop action modal with Record with AI option cleared and fields to enter name and description for the desktop action.
    8. Enter a name and description for the desktop action.
    9. Select Continue.
    10. In the modal, review the tips and select Open recorder to begin.
      Guided slow model for providing tips for effective recording.

      The AI Desktop Actions window is minimized and the Action recorder panel is launched. You can freely drag and reposition the Action recorder panel anywhere on your desktop screen.

      Floating recorder panel that has Discard, Pause, and Start recording UI actions.
    11. Open the applications that you want to record steps for.
    12. From the Action recorder panel, select Start recording.
      Important:
      Before you start recording, review the tips for accurate capturing of anchors and steps. For more information, see Tips for accurate recording.
      You will see a "Recording started" message on the Action recorder panel. You can select any of the following options when needed from the More options menu:
      • Pause: Skip recording steps
      • Restart: Restart recording the steps

        You will lose the recorded screens and steps.

      • Discard: Discard the recording if it doesn't meet your needs
    13. Perform the steps that you want to automate.

      Each step that you perform is captured sequentially and the type of UI action is displayed for each step. For example, Capturing Mouse Left Click event.

      You can capture maximum of 50 steps using the recorder in a recording session. While auto-capturing steps, a counter displays the remaining number of steps you can capture using the recorder (for example, "35 of 50 max"). Recording stops automatically after you capture 50 steps. If you must capture additional steps, start a new recording session. The new recording adds screens and steps to those captured in the previous recording.

    14. After you're done with all the steps, select End recording on the Action recorder panel.

      You will see the "Draft workflow saved" message on the Action recorder panel.

      The recorded steps are displayed as screenshots in the Design workspace with anchors and steps automatically added.

    15. Review the sequence of captured screens and adjust if necessary.
      You can adjust the sequence by dragging the screens in the Screens and steps panel.
    16. Review the location of anchors and adjust as necessary.
      Note:
      If two applications in the frame have similar logos or visual elements, verify that the anchor position is unique to the target application to avoid incorrect element identification during automation.

      The auto-anchoring accuracy depends on the application's accessibility, performance, and UI complexity.

    17. Review and adjust steps.
      You can modify the sequence by adding, removing, or dragging steps.
      1. From the Anchor control menu, select the Add step icon .
      2. Select the type of step to perform for this step from the contextual menu.
        Screen capture of an app with anchor added, displaying various type of input and output steps.
        Table 1. Description of the actions
        Goal Step Type Example
        Enter text in a field Set Text Input Enter any text data such as a user name, an address, a survey response, or in any situation where text entry is accepted.
        Note:
        If you set a static value for this field, the automation uses it during execution and doesn’t prompt you for input from the Now Assist panel.
        Simulate a mouse click Click Input Click a button, open a menu, or perform any step typically performed by a mouse click.
        Simulate an alternative mouse action (for example, right-click, drag, scroll, or paste) Mouse Click Input Perform various mouse device actions, such as right-click and select an object or scroll on a web page.
        Simulate a key press or a key function Send Keys Input Perform keyboard shortcuts, such as copying text by entering Ctrl + C on fields and elements.
        Note:
        If you set a static value for this field, the automation uses it during execution and doesn’t prompt you for input from the Now Assist panel.
        Capture text from a window or web page Get Text Output Receive text from the source area.
        Capture a table Get Table Output Receive table from the source area when the text is in the table format.
        Note:
        For the step to capture table data successfully, the data must already be in the table form. The step can’t convert ordinary text to table data.
        Read text from an image OCR Read Text Output Recognize text from images and return it in the standard text format.

        You can add multiple steps representing your automation steps.

    18. Configure the properties for added screens, anchors, and steps in the Properties panel.
    19. Optional: Modify the auto-generated names for all added screens, anchors, and steps.

      You can modify the auto-generated names following these naming guidelines.

      • Name fields must not be empty.
      • Name fields must contain only alphanumeric characters. Spaces and special characters are not permitted.
      • Each name must be unique at its respective parent level.
        • Each screen must have a unique name at the desktop-action level.
        • Each anchor must have a unique name at the screen level.
        • Each step must have a unique name at the anchor level.

    What to do next

    1. Configure the details of your desktop action. For more information, see Add details to desktop actions in AI Desktop Actions.
    2. Test and activate the desktop action so that it can be added as a tool to AI agents. For more information, see Test and activate a desktop action in AI Desktop Actions.
    3. Add the desktop action as a tool to AI agents in AI Agent Studio. For more information, see Add a defined desktop action tool to an AI agent for desktop and web-based task.