Apache Kafka data input configuration fields
Summarize
Summary of Apache Kafka data input configuration fields
The Apache Kafka data input configuration in ServiceNow enables customers to stream log data from Apache Kafka clusters into their instance efficiently and securely. This configuration is essential for integrating Kafka-based log sources, ensuring high availability through MID Server clusters, and managing authentication and data ingestion parameters.
Show less
Basic Configuration
- Name and Description: Specify a unique name and optional description for the data input.
- Execute on: Choose whether to use a specific MID Server or a MID Server cluster for log ingestion.
- MID Server Selection: When selecting a specific MID Server, only those supporting basic authentication are available. mTLS-enabled servers are excluded. The default capacity is 10 data inputs per MID Server, adjustable via MID Server properties.
- MID Server Cluster: Supports only failover clusters composed of MID Servers with basic authentication. The system handles failover by moving data inputs to another server if one fails. Log ingestion must be enabled on all cluster members; Health Log Analytics will enable it automatically if needed.
- Service Instance: Bind the data input to an existing service instance, which must be set to Operational. Create a new service instance and add Configuration Items (CIs) if none exists.
- Read-only Fields: Status, transport protocol (Apache Kafka), number of log sources created, timestamp of last log, any streaming errors, and the time since disabling or failure.
Query Settings
- From: Defines the starting point in time for log data reading; logs older than this date/time will not be ingested. It is mandatory and setting a past date may cause heavy processing.
Transport Settings
- Kafka Node Names: List of Kafka cluster nodes in HOST:PORT format for connecting to the Kafka cluster; does not need to include all nodes.
- Topics: Comma-separated Kafka topics to subscribe to; required field.
- Kafka Credentials: Reference to credentials managing authentication protocols such as SSL, SASLSSL, or SASLPLAINTEXT. Credentials are managed through a separate Kafka SSL credentials form.
- Group Id: Name of the Kafka Consumer Group used for consuming messages.
Advanced Configuration
- Timeouts: Poll timeout (default 500 ms) and node discovery timeout (default 30 ms) control Kafka connectivity behavior.
- Default Timezone: Used if log entries lack timezone information; defaults to GMT.
- Sub-sampling Ratios: Controls the batch processing and event discarding for load management; default values disable this feature.
- Max Length in Bytes: Sets the maximum event size to 32766 bytes.
- Character Encoding: Specifies UTF-8 encoding for log data.
- Drop if Queue is Full: Option to discard logs under MID Server load; default is false.
Kafka SSL Credentials
The Kafka SSL credentials form allows customers to configure authentication methods for connecting to Kafka, choosing between SSL channel, SASL with SSL, and SASL without encryption. This ensures secure and compliant log ingestion.
Description of the fields on the Apache Kafka data input configuration form.
Basic configuration
| Field | Description |
|---|---|
| Name | Name of the new data input. This field is required. |
| Description | Description of the data input. |
| Execute on | Option to select whether to use a specific MID Server or a MID Server cluster. This field is required. |
| MID |
(Only when the Execute on field is set to Specific MID Server) MID Server to which log data from Apache Kafka is pulled.Note: This field is required.
|
| MID Server Cluster | (Only when Execute on is set to Specific MID Server cluster.) The MID Server cluster to which the log data is pulled. This field is required.The data input runs on a single MID Server in the cluster until that MID Server fails. The system then moves all the data input tasks to the next available MID Server in the cluster according to the configured order. Note: For more information about MID Server clusters, see Configure a MID Server cluster.
|
| Service instance | The service instance to which to bind the log data. Note: This field is required.If no relevant
service instance exists, Create an service instance and add CIs to it. Set the status of the new service instance to
Operational. |
| Field | Description |
|---|---|
| Status | Status of the data input. |
| Transport | Protocol used to stream the log data. This data input uses Apache Kafka to stream log data to your instance. |
| Sources count | The number of log sources this data input has created. |
| Disabled since | The time when the data input stopped or failed. |
| Last log time | The time when the last log streamed in the data input. |
| Error message | The streaming error. This field is populated automatically. It displays only when a streaming error has occurred. |
| Field | Description | Example |
|---|---|---|
| From | Starting date and time for reading the data. Data older than this date and
time is not read. Note:
Setting this value to a past date might require the system
to read large amounts of data, causing congestion. This field is required. |
Now -1 week |
| Field | Description | Example |
|---|---|---|
| Kafka node names | A comma-separated list in the format HOST:PORT,HOST:PORT. The list does not have to include all the Apache Kafka Cluster servers. | 123.4.5.6:9092,123.3.4.5:9093 |
| Topics | A comma-separated list of topics to which the data input must
subscribe. This field is required. |
FirstTopic,SecondTopic,ThirdTopic |
| Kafka credentials | Reference to the Apache Kafka credentials. You can display the Kafka SSL credentials form by selecting the Search icon (
|
None |
| Group Id | The name of the Apache Kafka Consumer Group. | logs |
Advanced configuration
| Field | Description | Default value |
|---|---|---|
| Timeout | The time, in milliseconds, spent waiting in the poll if data is not available in the topics. | 500 |
| Node discovery timeout | The time, in milliseconds, before node discovery times out. | 30 |
| Default timezone | The default timezone if the log doesn't include timezone information. | GMT |
| Sub sample drop ratio | The number of events to batch together, out of which one will be discarded. This setting is used to reduce the number of fetched events. | -1 |
| Sub sample receive ratio | The number of events to batch together, out of which all but one will be discarded. This setting is used to decrease the number of received events. | -1 |
| Max length in bytes | The maximum length, in bytes, of events. | 32766 |
| Character encoding | The character encoding for this data input. | UTF-8 |
| Drop if queue is full | Option to discard logs if there is a load on the MID Server. | False |