VRL function reference
Here you’ll find a comprehensive list of all built-in VRL functions. Functions are categorized by their purpose and sorted alphabetically for easy discovery. To use these functions in Vector, see the documentation on function call expressions and Vector’s remap transform.
Array functions
append
infallible pureitems array to the end of the value array.chunks
fallible purevalue into slices of length chunk_size bytes.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | array string | The array of bytes to split. | yes | |
| chunk_size | integer | The desired length of each chunk in bytes. This may be constrained by the host platform architecture. | yes |
Errors
Thechunks function is fallible, which means that
error handling is required for these errors:chunk_size must be at least 1 byte.chunk_size is too large.push
infallible pureitem to the end of the value array.Codec functions
decode_base16
fallible purevalue (a Base16 string) into its original string.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The Base16 data to decode. | yes |
Errors
Thedecode_base16 function is fallible, which means that
error handling is required for these errors:value isn’t a valid encoded Base16 string.decode_base64
fallible purevalue (a Base64 string) into its original string.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The Base64 data to decode. | yes | |
| charset | string | The character set to use when decoding the data. | standard | no |
Errors
Thedecode_base64 function is fallible, which means that
error handling is required for these errors:value isn’t a valid encoded Base64 string.decode_gzip
fallible purevalue (a Gzip string) into its original string.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The Gzip data to decode. | yes |
Errors
Thedecode_gzip function is fallible, which means that
error handling is required for these errors:value isn’t a valid encoded Gzip string.decode_mime_q
fallible purevalue with their original string.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string with encoded-words to decode. | yes |
Errors
Thedecode_mime_q function is fallible, which means that
error handling is required for these errors:value has invalid encoded encoded-word string.decode_percent
infallible purevalue like a URL.decode_punycode
fallible purevalue, such as an internationalized domain name (IDN). This function assumes that the value passed is meant to be used in IDN context and that it is either a domain name or a part of it.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to decode. | yes | |
| validate | boolean | If enabled, checks if the input string is a valid domain name. | true | no |
Errors
Thedecode_punycode function is fallible, which means that
error handling is required for these errors:value is not valid punycodedecode_snappy
fallible purevalue (a Snappy string) into its original string.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The Snappy data to decode. | yes |
Errors
Thedecode_snappy function is fallible, which means that
error handling is required for these errors:value isn’t a valid encoded Snappy string.decode_zlib
fallible purevalue (a Zlib string) into its original string.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The Zlib data to decode. | yes |
Errors
Thedecode_zlib function is fallible, which means that
error handling is required for these errors:value isn’t a valid encoded Zlib string.decode_zstd
fallible purevalue (a Zstandard string) into its original string.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The Zstandard data to decode. | yes |
Errors
Thedecode_zstd function is fallible, which means that
error handling is required for these errors:value isn’t a valid encoded Zstd string.encode_base16
infallible purevalue to Base16.encode_base64
infallible purevalue to Base64.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to encode. | yes | |
| padding | boolean | Whether the Base64 output is padded. | true | no |
| charset | string | The character set to use when encoding the data. | standard | no |
encode_gzip
infallible purevalue to Gzip.encode_json
infallible purevalue to JSON.encode_key_value
fallible purevalue into key-value format with customizable delimiters. Default delimiters match
the logfmt format.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | object | The value to convert to a string. | yes | |
| fields_ordering | array | The ordering of fields to preserve. Any fields not in this list are listed unordered, after all ordered fields. | no | |
| key_value_delimiter | string | The string that separates the key from the value. | = | no |
| field_delimiter | string | The string that separates each key-value pair. | | no |
| flatten_boolean | boolean | Whether to encode key-value with a boolean value as a standalone key if true and nothing if false. | no |
Notices
This function has special behavior that you should be aware of.fields_ordering is specified then the function is fallible else it is infallible.Errors
Theencode_key_value function is fallible, which means that
error handling is required for these errors:fields_ordering contains a non-string element.Examples
Encode with default delimiters (no ordering)
encode_key_value({"ts": "2021-06-05T17:20:00Z", "msg": "This is a message", "lvl": "info"})lvl=info msg="This is a message" ts=2021-06-05T17:20:00ZEncode with default delimiters (fields ordering)
encode_key_value!({"ts": "2021-06-05T17:20:00Z", "msg": "This is a message", "lvl": "info", "log_id": 12345}, ["ts", "lvl", "msg"])ts=2021-06-05T17:20:00Z lvl=info msg="This is a message" log_id=12345Encode with default delimiters (nested fields)
encode_key_value({"agent": {"name": "foo"}, "log": {"file": {"path": "my.log"}}, "event": "log"})agent.name=foo event=log log.file.path=my.logEncode with default delimiters (nested fields ordering)
encode_key_value!({"agent": {"name": "foo"}, "log": {"file": {"path": "my.log"}}, "event": "log"}, ["event", "log.file.path", "agent.name"])event=log log.file.path=my.log agent.name=fooEncode with custom delimiters (no ordering)
encode_key_value(
{"ts": "2021-06-05T17:20:00Z", "msg": "This is a message", "lvl": "info"},
field_delimiter: ",",
key_value_delimiter: ":"
)lvl:info,msg:"This is a message",ts:2021-06-05T17:20:00ZEncode with custom delimiters and flatten boolean
encode_key_value(
{"ts": "2021-06-05T17:20:00Z", "msg": "This is a message", "lvl": "info", "beta": true, "dropped": false},
field_delimiter: ",",
key_value_delimiter: ":",
flatten_boolean: true
)beta,lvl:info,msg:"This is a message",ts:2021-06-05T17:20:00Zencode_logfmt
fallible purevalue to logfmt.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | object | The value to convert to a logfmt string. | yes | |
| fields_ordering | array | The ordering of fields to preserve. Any fields not in this list are listed unordered, after all ordered fields. | no |
Notices
This function has special behavior that you should be aware of.fields_ordering is specified then the function is fallible else it is infallible.Errors
Theencode_logfmt function is fallible, which means that
error handling is required for these errors:fields_ordering contains a non-string element.Examples
Encode to logfmt (no ordering)
encode_logfmt({"ts": "2021-06-05T17:20:00Z", "msg": "This is a message", "lvl": "info"})lvl=info msg="This is a message" ts=2021-06-05T17:20:00ZEncode to logfmt (fields ordering)
encode_logfmt!({"ts": "2021-06-05T17:20:00Z", "msg": "This is a message", "lvl": "info", "log_id": 12345}, ["ts", "lvl", "msg"])ts=2021-06-05T17:20:00Z lvl=info msg="This is a message" log_id=12345encode_percent
infallible purevalue with percent encoding to safely be used in URLs.encode_proto
fallible purevalue into a protocol buffer payload.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | object | The object to convert to a protocol buffer payload. | yes | |
| desc_file | string | The path to the protobuf descriptor set file. Must be a literal string. This file is the output of protoc -o | yes | |
| message_type | string | The name of the message type to use for serializing. Must be a literal string. | yes |
Errors
Theencode_proto function is fallible, which means that
error handling is required for these errors:desc_file file does not exist.message_type message type does not exist in the descriptor file.encode_punycode
fallible purevalue to punycode. Useful for internationalized domain names (IDN). This function assumes that the value passed is meant to be used in IDN context and that it is either a domain name or a part of it.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to encode. | yes | |
| validate | boolean | Whether to validate the input string to check if it is a valid domain name. | true | no |
Errors
Theencode_punycode function is fallible, which means that
error handling is required for these errors:value can not be encoded to punycodeencode_snappy
fallible purevalue to Snappy.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to encode. | yes |
Errors
Theencode_snappy function is fallible, which means that
error handling is required for these errors:value cannot be encoded into a Snappy string.encode_zlib
infallible purevalue to Zlib.encode_zstd
infallible purevalue to Zstandard.Coerce functions
to_bool
fallible purevalue into a boolean.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | boolean integer float null string | The value to convert to a Boolean. | yes |
Errors
Theto_bool function is fallible, which means that
error handling is required for these errors:value is not a supported boolean representation.to_float
fallible purevalue into a float.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | integer float boolean string timestamp | The value to convert to a float. Must be convertible to a float, otherwise an error is raised. | yes |
Errors
Theto_float function is fallible, which means that
error handling is required for these errors:value is not a supported float representation.to_int
fallible purevalue into an integer.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | integer float boolean string timestamp null | The value to convert to an integer. | yes |
Errors
Theto_int function is fallible, which means that
error handling is required for these errors:value is a string but the text is not an integer.value is not a string, int, or timestamp.to_regex
fallible purevalue into a regex.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The value to convert to a regex. | yes |
Notices
This function has special behavior that you should be aware of.Errors
Theto_regex function is fallible, which means that
error handling is required for these errors:value is not a string.to_string
fallible purevalue into a string.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | integer float boolean string timestamp null | The value to convert to a string. | yes |
Errors
Theto_string function is fallible, which means that
error handling is required for these errors:value is not an integer, float, boolean, string, timestamp, or null.Convert functions
from_unix_timestamp
infallible pureConverts the value integer from a Unix timestamp to a VRL timestamp.
Converts from the number of seconds since the Unix epoch by default. To convert from milliseconds or nanoseconds, set the unit argument to milliseconds or nanoseconds.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | integer | The Unix timestamp to convert. | yes | |
| unit | string | The time unit. | seconds | no |
to_syslog_facility
fallible purevalue, a Syslog facility code, into its corresponding
Syslog keyword. For example, 0 into "kern", 1 into "user", etc.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | integer | The facility code. | yes |
Errors
Theto_syslog_facility function is fallible, which means that
error handling is required for these errors:value is not a valid Syslog facility code.to_syslog_level
fallible purevalue, a Syslog severity level, into its corresponding keyword,
i.e. 0 into "emerg", 1 into "alert", etc.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | integer | The severity level. | yes |
Errors
Theto_syslog_level function is fallible, which means that
error handling is required for these errors:value isn’t a valid Syslog severity level.to_syslog_severity
fallible pureFunction spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The Syslog level keyword to convert. | yes |
Errors
Theto_syslog_severity function is fallible, which means that
error handling is required for these errors:value is not a valid Syslog level keyword.to_unix_timestamp
infallible pureConverts the value timestamp into a Unix timestamp.
Returns the number of seconds since the Unix epoch by default. To return the number in milliseconds or nanoseconds, set the unit argument to milliseconds or nanoseconds.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | timestamp | The timestamp to convert into a Unix timestamp. | yes | |
| unit | string | The time unit. | seconds | no |
Debug functions
assert
fallible impurecondition, which must be a Boolean expression. The program is aborted with
message if the condition evaluates to false.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| condition | boolean | The condition to check. | yes | |
| message | string | An optional custom error message. If the equality assertion fails, message is
appended to the default message prefix. See the examples below
for a fully formed log message sample. | no |
Notices
This function has special behavior that you should be aware of.assert function should be used in a standalone fashion and only when you want to abort the program. You
should avoid it in logical expressions and other situations in which you want the program to continue if the
condition evaluates to false.Errors
Theassert function is fallible, which means that
error handling is required for these errors:condition evaluates to false.assert_eq
infallible impureleft and right, have the same value. The program is
aborted with message if they do not have the same value.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| left | any | The value to check for equality against right. | yes | |
| right | any | The value to check for equality against left. | yes | |
| message | string | An optional custom error message. If the equality assertion fails, message is
appended to the default message prefix. See the examples
below for a fully formed log message sample. | no |
Notices
This function has special behavior that you should be aware of.assert_eq function should be used in a standalone fashion and only when you want to
abort the program. You should avoid it in logical expressions and other situations in which
you want the program to continue if the condition evaluates to false.log
infallible impureFunction spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | any | The value to log. | yes | |
| level | string | The log level. | info | no |
| rate_limit_secs | integer | Specifies that the log message is output no more than once per the given number of seconds.
Use a value of 0 to turn rate limiting off. | 1 | no |
Enrichment functions
find_enrichment_table_records
infallible pureSearches an enrichment table for rows that match the provided condition.
For file enrichment tables, this condition needs to be a VRL object in which
the key-value pairs indicate a field to search mapped to a value to search in that field.
This function returns the rows that match the provided condition(s). All fields need to
match for rows to be returned; if any fields do not match, then no rows are returned.
There are currently two forms of search criteria:
Exact match search. The given field must match the value exactly. Case sensitivity can be specified using the
case_sensitiveargument. An exact match search can use an index directly into the dataset, which should make this search fairly “cheap” from a performance perspective.Date range search. The given field must be greater than or equal to the
fromdate and less than or equal to thetodate. A date range search involves sequentially scanning through the rows that have been located using any exact match criteria. This can be an expensive operation if there are many rows returned by any exact match criteria. Therefore, use date ranges as the only criteria when the enrichment data set is very small.
For geoip and mmdb enrichment tables, this condition needs to be a VRL object with a single key-value pair
whose value needs to be a valid IP address. Example: {"ip": .ip }. If a return field is expected
and without a value, null is used. This table can return the following fields:
ISP databases:
autonomous_system_numberautonomous_system_organizationisporganization
City databases:
city_namecontinent_codecountry_codecountry_nameregion_coderegion_namemetro_codelatitudelongitudepostal_codetimezone
Connection-Type databases:
connection_type
To use this function, you need to update your configuration to
include an
enrichment_tables
parameter.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| table | string | The enrichment table to search. | yes | |
| condition | object | The condition to search on. Since the condition is used at boot time to create indices into the data, these conditions must be statically defined. | yes | |
| select | array | A subset of fields from the enrichment table to return. If not specified, all fields are returned. | no | |
| case_sensitive | boolean | Whether text fields need to match cases exactly. | true | no |
get_enrichment_table_record
fallible pureSearches an enrichment table for a row that matches the provided condition. A single row must be matched. If no rows are found or more than one row is found, an error is returned.
For file enrichment tables, this condition needs to be a VRL object in which
the key-value pairs indicate a field to search mapped to a value to search in that field.
This function returns the rows that match the provided condition(s). All fields need to
match for rows to be returned; if any fields do not match, then no rows are returned.
There are currently two forms of search criteria:
Exact match search. The given field must match the value exactly. Case sensitivity can be specified using the
case_sensitiveargument. An exact match search can use an index directly into the dataset, which should make this search fairly “cheap” from a performance perspective.Date range search. The given field must be greater than or equal to the
fromdate and less than or equal to thetodate. A date range search involves sequentially scanning through the rows that have been located using any exact match criteria. This can be an expensive operation if there are many rows returned by any exact match criteria. Therefore, use date ranges as the only criteria when the enrichment data set is very small.
For geoip and mmdb enrichment tables, this condition needs to be a VRL object with a single key-value pair
whose value needs to be a valid IP address. Example: {"ip": .ip }. If a return field is expected
and without a value, null is used. This table can return the following fields:
ISP databases:
autonomous_system_numberautonomous_system_organizationisporganization
City databases:
city_namecontinent_codecountry_codecountry_nameregion_coderegion_namemetro_codelatitudelongitudepostal_codetimezone
Connection-Type databases:
connection_type
To use this function, you need to update your configuration to
include an
enrichment_tables
parameter.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| table | string | The enrichment table to search. | yes | |
| condition | object | The condition to search on. Since the condition is used at boot time to create indices into the data, these conditions must be statically defined. | yes | |
| select | array | A subset of fields from the enrichment table to return. If not specified, all fields are returned. | no | |
| case_sensitive | boolean | Whether the text fields match the case exactly. | true | no |
Errors
Theget_enrichment_table_record function is fallible, which means that
error handling is required for these errors:Enumerate functions
compact
infallible purevalue by removing empty values, where empty values are defined using the
available parameters.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | array object | The object or array to compact. | yes | |
| recursive | boolean | Whether the compaction be recursive. | true | no |
| null | boolean | Whether null should be treated as an empty value. | true | no |
| string | boolean | Whether an empty string should be treated as an empty value. | true | no |
| object | boolean | Whether an empty object should be treated as an empty value. | true | no |
| array | boolean | Whether an empty array should be treated as an empty value. | true | no |
| nullish | boolean | Tests whether the value is “nullish” as defined by the is_nullish function. | no |
filter
infallible pureFilter elements from a collection.
This function currently does not support recursive iteration.
The function uses the function closure syntax to allow reading the key-value or index-value combination for each item in the collection.
The same scoping rules apply to closure blocks as they do for regular blocks. This means that any variable defined in parent scopes is accessible, and mutations to those variables are preserved, but any new variables instantiated in the closure block are unavailable outside of the block.
See the examples below to learn about the closure syntax.
flatten
infallible purevalue into a single-level representation.for_each
infallible pureIterate over a collection.
This function currently does not support recursive iteration.
The function uses the “function closure syntax” to allow reading the key/value or index/value combination for each item in the collection.
The same scoping rules apply to closure blocks as they do for regular blocks. This means that any variable defined in parent scopes is accessible, and mutations to those variables are preserved, but any new variables instantiated in the closure block are unavailable outside of the block.
See the examples below to learn about the closure syntax.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | array object | The array or object to iterate. | yes |
Examples
Tally elements
tally = {}
for_each(array!(.tags)) -> |_index, value| {
# Get the current tally for the `value`, or
# set to `0`.
count = int(get!(tally, [value])) ?? 0
# Increment the tally for the value by `1`.
tally = set!(tally, [value], count + 1)
}
tally{
"bar": 1,
"baz": 1,
"foo": 2
}includes
infallible purevalue array includes the specified item.keys
infallible purelength
infallible pureReturns the length of the value.
- If
valueis an array, returns the number of elements. - If
valueis an object, returns the number of top-level keys. - If
valueis a string, returns the number of bytes in the string. If you want the number of characters, seestrlen.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | array object string | The array or object. | yes |
map_keys
infallible pureMap the keys within an object.
If recursive is enabled, the function iterates into nested
objects, using the following rules:
- Iteration starts at the root.
- For every nested object type:
- First return the key of the object type itself.
- Then recurse into the object, and loop back to item (1) in this list.
- Any mutation done on a nested object before recursing into it, are preserved.
- For every nested array type:
- First return the key of the array type itself.
- Then find all objects within the array, and apply item (2) to each individual object.
The above rules mean that map_keys with
recursive enabled finds all keys in the target,
regardless of whether nested objects are nested inside arrays.
The function uses the function closure syntax to allow reading the key for each item in the object.
The same scoping rules apply to closure blocks as they do for regular blocks. This means that any variable defined in parent scopes is accessible, and mutations to those variables are preserved, but any new variables instantiated in the closure block are unavailable outside of the block.
See the examples below to learn about the closure syntax.
map_values
infallible pureMap the values within a collection.
If recursive is enabled, the function iterates into nested
collections, using the following rules:
- Iteration starts at the root.
- For every nested collection type:
- First return the collection type itself.
- Then recurse into the collection, and loop back to item (1) in the list
- Any mutation done on a collection before recursing into it, are preserved.
The function uses the function closure syntax to allow mutating the value for each item in the collection.
The same scoping rules apply to closure blocks as they do for regular blocks, meaning, any variable defined in parent scopes are accessible, and mutations to those variables are preserved, but any new variables instantiated in the closure block are unavailable outside of the block.
Check out the examples below to learn about the closure syntax.
match_array
infallible purevalue array matches the pattern. By default, it checks that at least one element matches, but can be set to determine if all the elements match.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | array | The array. | yes | |
| pattern | regex | The regular expression pattern to match against. | yes | |
| all | boolean | Whether to match on all elements of value. | no |
strlen
infallible pureReturns the number of UTF-8 characters in value. This differs from
length which counts the number of bytes of a string.
Note: This is the count of Unicode scalar values which can sometimes differ from Unicode code points.
unflatten
infallible purevalue into a nested representation.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | object | The array or object to unflatten. | yes | |
| separator | string | The separator to split flattened keys. | . | no |
| recursive | boolean | Whether to recursively unflatten the object values. | true | no |
Examples
Unflatten
unflatten({
"foo.bar.baz": true,
"foo.bar.qux": false,
"foo.quux": 42
}){
"foo": {
"bar": {
"baz": true,
"qux": false
},
"quux": 42
}
}Unflatten recursively
unflatten({
"flattened.parent": {
"foo.bar": true,
"foo.baz": false
}
}){
"flattened": {
"parent": {
"foo": {
"bar": true,
"baz": false
}
}
}
}unique
infallible pureReturns the unique values for an array.
The first occurrence of each element is kept.
values
infallible pureEvent functions
get_secret
infallible pureremove_secret
infallible pureset_secret
infallible pureset_semantic_meaning
infallible purePath functions
del
infallible impureRemoves the field specified by the static path from the target.
For dynamic path deletion, see the remove function.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| path | path | The path of the field to delete. | yes | |
| compact | boolean | After deletion, if compact is true and there is an empty object or array left,
the empty object or array is also removed, cascading up to the root. This only
applies to the path being deleted, and any parent paths. | no |
exists
infallible pureChecks whether the path exists for the target.
This function distinguishes between a missing path
and a path with a null value. A regular path lookup,
such as .foo, cannot distinguish between the two cases
since it always returns null if the path doesn’t exist.
get
fallible pureDynamically get the value of a given path.
If you know the path you want to look up, use
static paths such as .foo.bar[1] to get the value of that
path. However, if you do not know the path names,
use the dynamic get function to get the requested
value.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | object array | The object or array to query. | yes | |
| path | array | An array of path segments to look for the value. | yes |
Errors
Theget function is fallible, which means that
error handling is required for these errors:path segment must be a string or an integer.remove
fallible pureDynamically remove the value for a given path.
If you know the path you want to remove, use
the del function and static paths such as del(.foo.bar[1])
to remove the value at that path. The del function returns the
deleted value, and is more performant than remove.
However, if you do not know the path names, use the dynamic
remove function to remove the value at the provided path.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | object array | The object or array to remove data from. | yes | |
| path | array | An array of path segments to remove the value from. | yes | |
| compact | boolean | After deletion, if compact is true, any empty objects or
arrays left are also removed. | no |
Errors
Theremove function is fallible, which means that
error handling is required for these errors:path segment must be a string or an integer.set
fallible pureDynamically insert data into the path of a given object or array.
If you know the path you want to assign a value to,
use static path assignments such as .foo.bar[1] = true for
improved performance and readability. However, if you do not
know the path names, use the dynamic set function to
insert the data into the object or array.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | object array | The object or array to insert data into. | yes | |
| path | array | An array of path segments to insert the value into. | yes | |
| data | any | The data to be inserted. | yes |
Errors
Theset function is fallible, which means that
error handling is required for these errors:path segment must be a string or an integer.Cryptography functions
decrypt
fallible pureDecrypts a string with a symmetric encryption algorithm.
Supported Algorithms:
- AES-256-CFB (key = 32 bytes, iv = 16 bytes)
- AES-192-CFB (key = 24 bytes, iv = 16 bytes)
- AES-128-CFB (key = 16 bytes, iv = 16 bytes)
- AES-256-OFB (key = 32 bytes, iv = 16 bytes)
- AES-192-OFB (key = 24 bytes, iv = 16 bytes)
- AES-128-OFB (key = 16 bytes, iv = 16 bytes)
- AES-128-SIV (key = 32 bytes, iv = 16 bytes)
- AES-256-SIV (key = 64 bytes, iv = 16 bytes)
- Deprecated - AES-256-CTR (key = 32 bytes, iv = 16 bytes)
- Deprecated - AES-192-CTR (key = 24 bytes, iv = 16 bytes)
- Deprecated - AES-128-CTR (key = 16 bytes, iv = 16 bytes)
- AES-256-CTR-LE (key = 32 bytes, iv = 16 bytes)
- AES-192-CTR-LE (key = 24 bytes, iv = 16 bytes)
- AES-128-CTR-LE (key = 16 bytes, iv = 16 bytes)
- AES-256-CTR-BE (key = 32 bytes, iv = 16 bytes)
- AES-192-CTR-BE (key = 24 bytes, iv = 16 bytes)
- AES-128-CTR-BE (key = 16 bytes, iv = 16 bytes)
- AES-256-CBC-PKCS7 (key = 32 bytes, iv = 16 bytes)
- AES-192-CBC-PKCS7 (key = 24 bytes, iv = 16 bytes)
- AES-128-CBC-PKCS7 (key = 16 bytes, iv = 16 bytes)
- AES-256-CBC-ANSIX923 (key = 32 bytes, iv = 16 bytes)
- AES-192-CBC-ANSIX923 (key = 24 bytes, iv = 16 bytes)
- AES-128-CBC-ANSIX923 (key = 16 bytes, iv = 16 bytes)
- AES-256-CBC-ISO7816 (key = 32 bytes, iv = 16 bytes)
- AES-192-CBC-ISO7816 (key = 24 bytes, iv = 16 bytes)
- AES-128-CBC-ISO7816 (key = 16 bytes, iv = 16 bytes)
- AES-256-CBC-ISO10126 (key = 32 bytes, iv = 16 bytes)
- AES-192-CBC-ISO10126 (key = 24 bytes, iv = 16 bytes)
- AES-128-CBC-ISO10126 (key = 16 bytes, iv = 16 bytes)
- CHACHA20-POLY1305 (key = 32 bytes, iv = 12 bytes)
- XCHACHA20-POLY1305 (key = 32 bytes, iv = 24 bytes)
- XSALSA20-POLY1305 (key = 32 bytes, iv = 24 bytes)
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| ciphertext | string | The string in raw bytes (not encoded) to decrypt. | yes | |
| algorithm | string | The algorithm to use. | yes | |
| key | string | The key in raw bytes (not encoded) for decryption. The length must match the algorithm requested. | yes | |
| iv | string | The IV in raw bytes (not encoded) for decryption. The length must match the algorithm requested.
A new IV should be generated for every message. You can use random_bytes to generate a cryptographically secure random value.
The value should match the one used during encryption. | yes |
Errors
Thedecrypt function is fallible, which means that
error handling is required for these errors:algorithm is not a supported algorithm.key length does not match the key size required for the algorithm specified.iv length does not match the iv size required for the algorithm specified.encrypt
fallible pureEncrypts a string with a symmetric encryption algorithm.
Supported Algorithms:
- AES-256-CFB (key = 32 bytes, iv = 16 bytes)
- AES-192-CFB (key = 24 bytes, iv = 16 bytes)
- AES-128-CFB (key = 16 bytes, iv = 16 bytes)
- AES-256-OFB (key = 32 bytes, iv = 16 bytes)
- AES-192-OFB (key = 24 bytes, iv = 16 bytes)
- AES-128-OFB (key = 16 bytes, iv = 16 bytes)
- AES-128-SIV (key = 32 bytes, iv = 16 bytes)
- AES-256-SIV (key = 64 bytes, iv = 16 bytes)
- Deprecated - AES-256-CTR (key = 32 bytes, iv = 16 bytes)
- Deprecated - AES-192-CTR (key = 24 bytes, iv = 16 bytes)
- Deprecated - AES-128-CTR (key = 16 bytes, iv = 16 bytes)
- AES-256-CTR-LE (key = 32 bytes, iv = 16 bytes)
- AES-192-CTR-LE (key = 24 bytes, iv = 16 bytes)
- AES-128-CTR-LE (key = 16 bytes, iv = 16 bytes)
- AES-256-CTR-BE (key = 32 bytes, iv = 16 bytes)
- AES-192-CTR-BE (key = 24 bytes, iv = 16 bytes)
- AES-128-CTR-BE (key = 16 bytes, iv = 16 bytes)
- AES-256-CBC-PKCS7 (key = 32 bytes, iv = 16 bytes)
- AES-192-CBC-PKCS7 (key = 24 bytes, iv = 16 bytes)
- AES-128-CBC-PKCS7 (key = 16 bytes, iv = 16 bytes)
- AES-256-CBC-ANSIX923 (key = 32 bytes, iv = 16 bytes)
- AES-192-CBC-ANSIX923 (key = 24 bytes, iv = 16 bytes)
- AES-128-CBC-ANSIX923 (key = 16 bytes, iv = 16 bytes)
- AES-256-CBC-ISO7816 (key = 32 bytes, iv = 16 bytes)
- AES-192-CBC-ISO7816 (key = 24 bytes, iv = 16 bytes)
- AES-128-CBC-ISO7816 (key = 16 bytes, iv = 16 bytes)
- AES-256-CBC-ISO10126 (key = 32 bytes, iv = 16 bytes)
- AES-192-CBC-ISO10126 (key = 24 bytes, iv = 16 bytes)
- AES-128-CBC-ISO10126 (key = 16 bytes, iv = 16 bytes)
- CHACHA20-POLY1305 (key = 32 bytes, iv = 12 bytes)
- XCHACHA20-POLY1305 (key = 32 bytes, iv = 24 bytes)
- XSALSA20-POLY1305 (key = 32 bytes, iv = 24 bytes)
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| plaintext | string | The string to encrypt. | yes | |
| algorithm | string | The algorithm to use. | yes | |
| key | string | The key in raw bytes (not encoded) for encryption. The length must match the algorithm requested. | yes | |
| iv | string | The IV in raw bytes (not encoded) for encryption. The length must match the algorithm requested.
A new IV should be generated for every message. You can use random_bytes to generate a cryptographically secure random value. | yes |
Errors
Theencrypt function is fallible, which means that
error handling is required for these errors:algorithm is not a supported algorithm.key length does not match the key size required for the algorithm specified.iv length does not match the iv size required for the algorithm specified.Examples
Encrypt value
plaintext = "super secret message"
iv = "1234567890123456" # typically you would call random_bytes(16)
key = "16_byte_keyxxxxx"
encrypted_message = encrypt!(plaintext, "AES-128-CBC-PKCS7", key, iv: iv)
encode_base64(encrypted_message)GBw8Mu00v0Kc38+/PvsVtGgWuUJ+ZNLgF8Opy8ohIYE=hmac
infallible pureCalculates a HMAC of the value using the given key.
The hashing algorithm used can be optionally specified.
For most use cases, the resulting bytestream should be encoded into a hex or base64 string using either encode_base16 or encode_base64.
This function is infallible if either the default algorithm value or a recognized-valid compile-time
algorithm string literal is used. Otherwise, it is fallible.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to calculate the HMAC for. | yes | |
| key | string | The string to use as the cryptographic key. | yes | |
| algorithm | string | The hashing algorithm to use. | SHA-256 | no |
Examples
Calculate message HMAC (defaults: SHA-256), encoding to a base64 string
encode_base64(hmac("Hello there", "super-secret-key"))eLGE8YMviv85NPXgISRUZxstBNSU47JQdcXkUWcClmI=md5
infallible purevalue.seahash
infallible purevalue.
Note: Due to limitations in the underlying VRL data types, this function converts the unsigned 64-bit integer SeaHash result to a signed 64-bit integer. Results higher than the signed 64-bit integer maximum value wrap around to negative values.sha1
infallible purevalue.sha2
infallible purevalue.sha3
infallible purevalue.IP functions
ip_aton
fallible pureConverts IPv4 address in numbers-and-dots notation into network-order bytes represented as an integer.
This behavior mimics inet_aton.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The IP address to convert to binary. | yes |
Errors
Theip_aton function is fallible, which means that
error handling is required for these errors:value is not a valid IPv4 address.ip_cidr_contains
fallible pureip is contained in the block referenced by the cidr.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| cidr | string | The CIDR mask (v4 or v6). | yes | |
| ip | string | The IP address (v4 or v6). | yes |
Errors
Theip_cidr_contains function is fallible, which means that
error handling is required for these errors:cidr is not a valid CIDR.ip is not a valid IP address.ip_ntoa
fallible pureConverts numeric representation of IPv4 address in network-order bytes to numbers-and-dots notation.
This behavior mimics inet_ntoa.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The integer representation of an IPv4 address. | yes |
Errors
Theip_ntoa function is fallible, which means that
error handling is required for these errors:value cannot fit in an unsigned 32-bit integer.ip_ntop
fallible pureConverts IPv4 and IPv6 addresses from binary to text form.
This behavior mimics inet_ntop.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The binary data to convert from. For IPv4 addresses, it must be 4 bytes (32 bits) long. For IPv6 addresses, it must be 16 bytes (128 bits) long. | yes |
Notices
This function has special behavior that you should be aware of.decode_base64 or
decode_percent can still be used correctly.Errors
Theip_ntop function is fallible, which means that
error handling is required for these errors:value must be of length 4 or 16 bytes.ip_pton
fallible pureConverts IPv4 and IPv6 addresses from text to binary form.
- The binary form of IPv4 addresses is 4 bytes (32 bits) long.
- The binary form of IPv6 addresses is 16 bytes (128 bits) long.
This behavior mimics inet_pton.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The IP address (v4 or v6) to convert to binary form. | yes |
Notices
This function has special behavior that you should be aware of.encode_base64 or encode_percent can
still process it correctly.Errors
Theip_pton function is fallible, which means that
error handling is required for these errors:value is not a valid IP (v4 or v6) address in text form.ip_subnet
fallible pureip using the supplied subnet.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| ip | string | The IP address (v4 or v6). | yes | |
| subnet | string | The subnet to extract from the IP address. This can be either a prefix length like /8 or a net mask
like 255.255.0.0. The net mask can be either an IPv4 or IPv6 address. | yes |
Notices
This function has special behavior that you should be aware of.Errors
Theip_subnet function is fallible, which means that
error handling is required for these errors:ip is not a valid IP address.subnet is not a valid subnet.ip_to_ipv6
fallible pureip to an IPv6 address.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| ip | string | The IP address to convert to IPv6. | yes |
Errors
Theip_to_ipv6 function is fallible, which means that
error handling is required for these errors:ip is not a valid IP address.ipv6_to_ipv4
fallible pureip to an IPv4 address. ip is returned unchanged if it’s already an IPv4 address. If ip is
currently an IPv6 address then it needs to be IPv4 compatible, otherwise an error is thrown.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| ip | string | The IPv4-mapped IPv6 address to convert. | yes |
Errors
Theipv6_to_ipv4 function is fallible, which means that
error handling is required for these errors:ip is not a valid IP address.ip is an IPv6 address that is not compatible with IPv4.is_ipv4
infallible pureCheck if the string is a valid IPv4 address or not.
An [IPv4-mapped][https://datatracker.ietf.org/doc/html/rfc6890] or [IPv4-compatible][https://datatracker.ietf.org/doc/html/rfc6890] IPv6 address is not considered valid for the purpose of this function.
is_ipv6
infallible pureNumber functions
abs
infallible purevalue.ceil
infallible purevalue up to the specified precision.floor
infallible purevalue down to the specified precision.format_int
fallible purevalue into a string representation using the given base/radix.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | integer | The number to format. | yes | |
| base | integer | The base to format the number in. Must be between 2 and 36 (inclusive). | 10 | no |
Errors
Theformat_int function is fallible, which means that
error handling is required for these errors:format_number
infallible purevalue into a string representation of the number.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | integer float | The number to format as a string. | yes | |
| scale | integer | The number of decimal places to display. | no | |
| decimal_separator | string | The character to use between the whole and decimal parts of the number. | . | no |
| grouping_separator | string | The character to use between each thousands part of the number. | , | no |
mod
fallible purevalue divided by modulus.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | integer float | The value the modulus is applied to. | yes | |
| modulus | integer float | The modulus value. | yes |
Errors
Themod function is fallible, which means that
error handling is required for these errors:value is not an integer or float.modulus is not an integer or float.modulus is equal to 0.round
infallible purevalue to the specified precision.Object functions
match_datadog_query
infallible puremerge
infallible purefrom object into the to object.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| to | object | The object to merge into. | yes | |
| from | object | The object to merge from. | yes | |
| deep | boolean | A deep merge is performed if true, otherwise only top-level fields are merged. | no |
unnest
fallible pureUnnest an array field from an object to create an array of objects using that field; keeping all other fields.
Assigning the array result of this to . results in multiple events being emitted from remap. See the
remap transform docs for more details.
This is also referred to as explode in some languages.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| path | path | The path of the field to unnest. | yes |
Errors
Theunnest function is fallible, which means that
error handling is required for these errors:Parse functions
parse_apache_log
fallible purecommon,
combined, or the default error format.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes | |
| format | string | The format to use for parsing the log. | yes | |
| timestamp_format | string | The date/time format to use for encoding the timestamp. The time is parsed in local time if the timestamp does not specify a timezone. | %d/%b/%Y:%T %z | no |
Notices
This function has special behavior that you should be aware of.-. These fields are omitted in the result.Errors
Theparse_apache_log function is fallible, which means that
error handling is required for these errors:value does not match the specified format.timestamp_format is not a valid format string.value fails to parse using the provided timestamp_format.Examples
Parse using Apache log format (common)
parse_apache_log!("127.0.0.1 bob frank [10/Oct/2000:13:55:36 -0700] \"GET /apache_pb.gif HTTP/1.0\" 200 2326", format: "common"){
"host": "127.0.0.1",
"identity": "bob",
"message": "GET /apache_pb.gif HTTP/1.0",
"method": "GET",
"path": "/apache_pb.gif",
"protocol": "HTTP/1.0",
"size": 2326,
"status": 200,
"timestamp": "2000-10-10T20:55:36Z",
"user": "frank"
}Parse using Apache log format (combined)
parse_apache_log!(
s'127.0.0.1 bob frank [10/Oct/2000:13:55:36 -0700] "GET /apache_pb.gif HTTP/1.0" 200 2326 "http://www.seniorinfomediaries.com/vertical/channels/front-end/bandwidth" "Mozilla/5.0 (X11; Linux i686; rv:5.0) Gecko/1945-10-12 Firefox/37.0"',
"combined",
){
"agent": "Mozilla/5.0 (X11; Linux i686; rv:5.0) Gecko/1945-10-12 Firefox/37.0",
"host": "127.0.0.1",
"identity": "bob",
"message": "GET /apache_pb.gif HTTP/1.0",
"method": "GET",
"path": "/apache_pb.gif",
"protocol": "HTTP/1.0",
"referrer": "http://www.seniorinfomediaries.com/vertical/channels/front-end/bandwidth",
"size": 2326,
"status": 200,
"timestamp": "2000-10-10T20:55:36Z",
"user": "frank"
}Parse using Apache log format (error)
parse_apache_log!(
s'[01/Mar/2021:12:00:19 +0000] [ab:alert] [pid 4803:tid 3814] [client 147.159.108.175:24259] I will bypass the haptic COM bandwidth, that should matrix the CSS driver!',
"error"
){
"client": "147.159.108.175",
"message": "I will bypass the haptic COM bandwidth, that should matrix the CSS driver!",
"module": "ab",
"pid": 4803,
"port": 24259,
"severity": "alert",
"thread": "3814",
"timestamp": "2021-03-01T12:00:19Z"
}parse_aws_alb_log
fallible purevalue in the Elastic Load Balancer Access format.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | Access log of the Application Load Balancer. | yes |
Errors
Theparse_aws_alb_log function is fallible, which means that
error handling is required for these errors:value is not a properly formatted AWS ALB log.Examples
Parse AWS ALB log
parse_aws_alb_log!(
"http 2018-11-30T22:23:00.186641Z app/my-loadbalancer/50dc6c495c0c9188 192.168.131.39:2817 - 0.000 0.001 0.000 200 200 34 366 \"GET http://www.example.com:80/ HTTP/1.1\" \"curl/7.46.0\" - - arn:aws:elasticloadbalancing:us-east-2:123456789012:targetgroup/my-targets/73e2d6bc24d8a067 \"Root=1-58337364-23a8c76965a2ef7629b185e3\" \"-\" \"-\" 0 2018-11-30T22:22:48.364000Z \"forward\" \"-\" \"-\" \"-\" \"-\" \"-\" \"-\""
){
"actions_executed": "forward",
"chosen_cert_arn": null,
"classification": null,
"classification_reason": null,
"client_host": "192.168.131.39:2817",
"domain_name": null,
"elb": "app/my-loadbalancer/50dc6c495c0c9188",
"elb_status_code": "200",
"error_reason": null,
"matched_rule_priority": "0",
"received_bytes": 34,
"redirect_url": null,
"request_creation_time": "2018-11-30T22:22:48.364000Z",
"request_method": "GET",
"request_processing_time": 0,
"request_protocol": "HTTP/1.1",
"request_url": "http://www.example.com:80/",
"response_processing_time": 0,
"sent_bytes": 366,
"ssl_cipher": null,
"ssl_protocol": null,
"target_group_arn": "arn:aws:elasticloadbalancing:us-east-2:123456789012:targetgroup/my-targets/73e2d6bc24d8a067",
"target_host": null,
"target_port_list": [],
"target_processing_time": 0.001,
"target_status_code": "200",
"target_status_code_list": [],
"timestamp": "2018-11-30T22:23:00.186641Z",
"trace_id": "Root=1-58337364-23a8c76965a2ef7629b185e3",
"traceability_id": null,
"type": "http",
"user_agent": "curl/7.46.0"
}parse_aws_cloudwatch_log_subscription_message
fallible pureaws_kinesis_firehose source.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string representation of the message to parse. | yes |
Errors
Theparse_aws_cloudwatch_log_subscription_message function is fallible, which means that
error handling is required for these errors:value is not a properly formatted AWS CloudWatch Log subscription message.Examples
Parse AWS Cloudwatch Log subscription message
parse_aws_cloudwatch_log_subscription_message!(.message){
"log_events": [
{
"id": "35683658089614582423604394983260738922885519999578275840",
"message": "{\"bytes\":26780,\"datetime\":\"14/Sep/2020:11:45:41 -0400\",\"host\":\"157.130.216.193\",\"method\":\"PUT\",\"protocol\":\"HTTP/1.0\",\"referer\":\"https://www.principalcross-platform.io/markets/ubiquitous\",\"request\":\"/expedite/convergence\",\"source_type\":\"stdin\",\"status\":301,\"user-identifier\":\"-\"}",
"timestamp": "2020-09-14T19:09:29.039Z"
}
],
"log_group": "test",
"log_stream": "test",
"message_type": "DATA_MESSAGE",
"owner": "111111111111",
"subscription_filters": [
"Destination"
]
}parse_aws_vpc_flow_log
fallible purevalue in the VPC Flow Logs format.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | VPC Flow Log. | yes | |
| format | string | VPC Flow Log format. | no |
Errors
Theparse_aws_vpc_flow_log function is fallible, which means that
error handling is required for these errors:value is not a properly formatted AWS VPC Flow log.Examples
Parse AWS VPC Flow log (default format)
parse_aws_vpc_flow_log!("2 123456789010 eni-1235b8ca123456789 - - - - - - - 1431280876 1431280934 - NODATA"){
"account_id": "123456789010",
"action": null,
"bytes": null,
"dstaddr": null,
"dstport": null,
"end": 1431280934,
"interface_id": "eni-1235b8ca123456789",
"log_status": "NODATA",
"packets": null,
"protocol": null,
"srcaddr": null,
"srcport": null,
"start": 1431280876,
"version": 2
}Parse AWS VPC Flow log (custom format)
parse_aws_vpc_flow_log!(
"- eni-1235b8ca123456789 10.0.1.5 10.0.0.220 10.0.1.5 203.0.113.5",
"instance_id interface_id srcaddr dstaddr pkt_srcaddr pkt_dstaddr"
){
"dstaddr": "10.0.0.220",
"instance_id": null,
"interface_id": "eni-1235b8ca123456789",
"pkt_dstaddr": "203.0.113.5",
"pkt_srcaddr": "10.0.1.5",
"srcaddr": "10.0.1.5"
}Parse AWS VPC Flow log including v5 fields
parse_aws_vpc_flow_log!("5 52.95.128.179 10.0.0.71 80 34210 6 1616729292 1616729349 IPv4 14 15044 123456789012 vpc-abcdefab012345678 subnet-aaaaaaaa012345678 i-0c50d5961bcb2d47b eni-1235b8ca123456789 ap-southeast-2 apse2-az3 - - ACCEPT 19 52.95.128.179 10.0.0.71 S3 - - ingress OK",
format: "version srcaddr dstaddr srcport dstport protocol start end type packets bytes account_id vpc_id subnet_id instance_id interface_id region az_id sublocation_type sublocation_id action tcp_flags pkt_srcaddr pkt_dstaddr pkt_src_aws_service pkt_dst_aws_service traffic_path flow_direction log_status"){
"account_id": "123456789012",
"action": "ACCEPT",
"az_id": "apse2-az3",
"bytes": 15044,
"dstaddr": "10.0.0.71",
"dstport": 34210,
"end": 1616729349,
"flow_direction": "ingress",
"instance_id": "i-0c50d5961bcb2d47b",
"interface_id": "eni-1235b8ca123456789",
"log_status": "OK",
"packets": 14,
"pkt_dst_aws_service": null,
"pkt_dstaddr": "10.0.0.71",
"pkt_src_aws_service": "S3",
"pkt_srcaddr": "52.95.128.179",
"protocol": 6,
"region": "ap-southeast-2",
"srcaddr": "52.95.128.179",
"srcport": 80,
"start": 1616729292,
"sublocation_id": null,
"sublocation_type": null,
"subnet_id": "subnet-aaaaaaaa012345678",
"tcp_flags": 19,
"traffic_path": null,
"type": "IPv4",
"version": 5,
"vpc_id": "vpc-abcdefab012345678"
}parse_cef
fallible purevalue in CEF (Common Event Format) format. Ignores everything up to CEF header. Empty values are returned as empty strings. Surrounding quotes are removed from values.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes | |
| translate_custom_fields | boolean | Toggles translation of custom field pairs to key:value. | no |
Notices
This function has special behavior that you should be aware of.Errors
Theparse_cef function is fallible, which means that
error handling is required for these errors:value is not a properly formatted CEF string.Examples
Parse output generated by PTA
parse_cef!(
"CEF:0|CyberArk|PTA|12.6|1|Suspected credentials theft|8|suser=mike2@prod1.domain.com shost=prod1.domain.com src=1.1.1.1 duser=andy@dev1.domain.com dhost=dev1.domain.com dst=2.2.2.2 cs1Label=ExtraData cs1=None cs2Label=EventID cs2=52b06812ec3500ed864c461e deviceCustomDate1Label=detectionDate deviceCustomDate1=1388577900000 cs3Label=PTAlink cs3=https://1.1.1.1/incidents/52b06812ec3500ed864c461e cs4Label=ExternalLink cs4=None"
){
"cefVersion": "0",
"cs1": "None",
"cs1Label": "ExtraData",
"cs2": "52b06812ec3500ed864c461e",
"cs2Label": "EventID",
"cs3": "https://1.1.1.1/incidents/52b06812ec3500ed864c461e",
"cs3Label": "PTAlink",
"cs4": "None",
"cs4Label": "ExternalLink",
"deviceCustomDate1": "1388577900000",
"deviceCustomDate1Label": "detectionDate",
"deviceEventClassId": "1",
"deviceProduct": "PTA",
"deviceVendor": "CyberArk",
"deviceVersion": "12.6",
"dhost": "dev1.domain.com",
"dst": "2.2.2.2",
"duser": "andy@dev1.domain.com",
"name": "Suspected credentials theft",
"severity": "8",
"shost": "prod1.domain.com",
"src": "1.1.1.1",
"suser": "mike2@prod1.domain.com"
}Ignore syslog header
parse_cef!(
"Sep 29 08:26:10 host CEF:1|Security|threatmanager|1.0|100|worm successfully stopped|10|src=10.0.0.1 dst=2.1.2.2 spt=1232"
){
"cefVersion": "1",
"deviceEventClassId": "100",
"deviceProduct": "threatmanager",
"deviceVendor": "Security",
"deviceVersion": "1.0",
"dst": "2.1.2.2",
"name": "worm successfully stopped",
"severity": "10",
"spt": "1232",
"src": "10.0.0.1"
}Translate custom fields
parse_cef!(
"CEF:0|Dev|firewall|2.2|1|Connection denied|5|c6a1=2345:0425:2CA1:0000:0000:0567:5673:23b5 c6a1Label=Device IPv6 Address",
translate_custom_fields: true
){
"Device IPv6 Address": "2345:0425:2CA1:0000:0000:0567:5673:23b5",
"cefVersion": "0",
"deviceEventClassId": "1",
"deviceProduct": "firewall",
"deviceVendor": "Dev",
"deviceVersion": "2.2",
"name": "Connection denied",
"severity": "5"
}parse_common_log
fallible purevalue using the Common Log Format (CLF).Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes | |
| timestamp_format | string | The date/time format to use for encoding the timestamp. | %d/%b/%Y:%T %z | no |
Notices
This function has special behavior that you should be aware of.-. These fields are omitted in the result.Errors
Theparse_common_log function is fallible, which means that
error handling is required for these errors:value does not match the Common Log Format.timestamp_format is not a valid format string.value fails to parse using the provided timestamp_format.Examples
Parse using Common Log Format (with default timestamp format)
parse_common_log!("127.0.0.1 bob frank [10/Oct/2000:13:55:36 -0700] \"GET /apache_pb.gif HTTP/1.0\" 200 2326"){
"host": "127.0.0.1",
"identity": "bob",
"message": "GET /apache_pb.gif HTTP/1.0",
"method": "GET",
"path": "/apache_pb.gif",
"protocol": "HTTP/1.0",
"size": 2326,
"status": 200,
"timestamp": "2000-10-10T20:55:36Z",
"user": "frank"
}Parse using Common Log Format (with custom timestamp format)
parse_common_log!(
"127.0.0.1 bob frank [2000-10-10T20:55:36Z] \"GET /apache_pb.gif HTTP/1.0\" 200 2326",
"%+"
){
"host": "127.0.0.1",
"identity": "bob",
"message": "GET /apache_pb.gif HTTP/1.0",
"method": "GET",
"path": "/apache_pb.gif",
"protocol": "HTTP/1.0",
"size": 2326,
"status": 200,
"timestamp": "2000-10-10T20:55:36Z",
"user": "frank"
}parse_csv
fallible pureFunction spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes | |
| delimiter | string | The field delimiter to use when parsing. Must be a single-byte utf8 character. | , | no |
Notices
This function has special behavior that you should be aware of.Errors
Theparse_csv function is fallible, which means that
error handling is required for these errors:value is not a valid CSV string.parse_duration
fallible purevalue into a human-readable duration format specified by unit.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string of the duration. | yes | |
| unit | string | The output units for the duration. | yes |
Errors
Theparse_duration function is fallible, which means that
error handling is required for these errors:value is not a properly formatted duration.parse_etld
fallible purevalue representing domain name.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The domain string. | yes | |
| plus_parts | integer | Can be provided to get additional parts of the domain name. When 1 is passed, eTLD+1 will be returned, which represents a domain registrable by a single organization. Higher numbers will return subdomains. | no | |
| psl | string | Can be provided to use a different public suffix list. By default, https://publicsuffix.org/list/public_suffix_list.dat is used. | no |
Errors
Theparse_etld function is fallible, which means that
error handling is required for these errors:valueExamples
Parse eTLD
parse_etld!("sub.sussex.ac.uk"){
"etld": "ac.uk",
"etld_plus": "ac.uk",
"known_suffix": true
}Parse eTLD+1
parse_etld!("sub.sussex.ac.uk", plus_parts: 1){
"etld": "ac.uk",
"etld_plus": "sussex.ac.uk",
"known_suffix": true
}parse_glog
fallible purevalue using the glog (Google Logging Library) format.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes |
Errors
Theparse_glog function is fallible, which means that
error handling is required for these errors:value does not match the glog format.parse_grok
fallible pureFunction spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes | |
| pattern | string | The Grok pattern. | yes |
Notices
This function has special behavior that you should be aware of.Errors
Theparse_grok function is fallible, which means that
error handling is required for these errors:value fails to parse using the provided pattern.parse_groks
fallible pureFunction spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes | |
| patterns | array | The Grok patterns, which are tried in order until the first match. | yes | |
| aliases | object | The shared set of grok aliases that can be referenced in the patterns to simplify them. | true | no |
Notices
This function has special behavior that you should be aware of.Errors
Theparse_groks function is fallible, which means that
error handling is required for these errors:value fails to parse using the provided pattern.Examples
Parse using multiple Grok patterns
parse_groks!(
"2020-10-02T23:22:12.223222Z info Hello world",
patterns: [
"%{common_prefix} %{_status} %{_message}",
"%{common_prefix} %{_message}",
],
aliases: {
"common_prefix": "%{_timestamp} %{_loglevel}",
"_timestamp": "%{TIMESTAMP_ISO8601:timestamp}",
"_loglevel": "%{LOGLEVEL:level}",
"_status": "%{POSINT:status}",
"_message": "%{GREEDYDATA:message}"
}
){
"level": "info",
"message": "Hello world",
"timestamp": "2020-10-02T23:22:12.223222Z"
}parse_influxdb
fallible purevalue as an InfluxDB line protocol
string, producing a list of Vector-compatible metrics.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string representation of the InfluxDB line protocol to parse. | yes |
Notices
This function has special behavior that you should be aware of.log_to_metric transform with the option all_metrics
set to true to convert the metric-shaped log events to metric events so real metrics are produced.gauge. Each metric name is prefixed with the measurement field, followed
by an underscore (_), and then the field key field.string is the only type that is not supported as a field value,
due to limitations of Vector’s metric model.Errors
Theparse_influxdb function is fallible, which means that
error handling is required for these errors:value is not a valid InfluxDB line protocol string.string.NaN field value.Examples
Parse InfluxDB line protocol
parse_influxdb!("cpu,host=A,region=us-west usage_system=64i,usage_user=10u,temperature=50.5,on=true,sleep=false 1590488773254420000")[{"gauge":{"value":64},"kind":"absolute","name":"cpu_usage_system","tags":{"host":"A","region":"us-west"},"timestamp":"2020-05-26T10:26:13.254420Z"},{"gauge":{"value":10},"kind":"absolute","name":"cpu_usage_user","tags":{"host":"A","region":"us-west"},"timestamp":"2020-05-26T10:26:13.254420Z"},{"gauge":{"value":50.5},"kind":"absolute","name":"cpu_temperature","tags":{"host":"A","region":"us-west"},"timestamp":"2020-05-26T10:26:13.254420Z"},{"gauge":{"value":1},"kind":"absolute","name":"cpu_on","tags":{"host":"A","region":"us-west"},"timestamp":"2020-05-26T10:26:13.254420Z"},{"gauge":{"value":0},"kind":"absolute","name":"cpu_sleep","tags":{"host":"A","region":"us-west"},"timestamp":"2020-05-26T10:26:13.254420Z"}]parse_int
fallible purevalue representing a number in an optional base/radix to an integer.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes | |
| base | integer | The base the number is in. Must be between 2 and 36 (inclusive). If unspecified, the string prefix is used to determine the base: “0b”, 8 for “0” or “0o”, 16 for “0x”, and 10 otherwise. | no |
Errors
Theparse_int function is fallible, which means that
error handling is required for these errors:parse_json
fallible purevalue as JSON.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string representation of the JSON to parse. | yes | |
| max_depth | integer | Number of layers to parse for nested JSON-formatted documents. The value must be in the range of 1 to 128. | no | |
| lossy | boolean | Whether to parse the JSON in a lossy manner. Replaces invalid UTF-8 characters
with the Unicode character � (U+FFFD) if set to true, otherwise returns an error
if there are any invalid UTF-8 characters present. | true | no |
Notices
This function has special behavior that you should be aware of.string into a timestamp, consider the
parse_timestamp function.Errors
Theparse_json function is fallible, which means that
error handling is required for these errors:value is not a valid JSON-formatted payload.parse_key_value
fallible pureParses the value in key-value format. Also known as logfmt.
- Keys and values can be wrapped with
". "characters can be escaped using\.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes | |
| key_value_delimiter | string | The string that separates the key from the value. | = | no |
| field_delimiter | string | The string that separates each key-value pair. | | no |
| whitespace | string | Defines the acceptance of unnecessary whitespace surrounding the configured key_value_delimiter. | lenient | no |
| accept_standalone_key | boolean | Whether a standalone key should be accepted, the resulting object associates such keys with the boolean value true. | true | no |
Notices
This function has special behavior that you should be aware of.Errors
Theparse_key_value function is fallible, which means that
error handling is required for these errors:value is not a properly formatted key-value string.Examples
Parse logfmt log
parse_key_value!(
"@timestamp=\"Sun Jan 10 16:47:39 EST 2021\" level=info msg=\"Stopping all fetchers\" tag#production=stopping_fetchers id=ConsumerFetcherManager-1382721708341 module=kafka.consumer.ConsumerFetcherManager"
){
"@timestamp": "Sun Jan 10 16:47:39 EST 2021",
"id": "ConsumerFetcherManager-1382721708341",
"level": "info",
"module": "kafka.consumer.ConsumerFetcherManager",
"msg": "Stopping all fetchers",
"tag#production": "stopping_fetchers"
}Parse comma delimited log
parse_key_value!(
"path:\"/cart_link\", host:store.app.com, fwd: \"102.30.171.16\", dyno: web.1, connect:0ms, service:87ms, status:304, bytes:632, protocol:https",
field_delimiter: ",",
key_value_delimiter: ":"
){
"bytes": "632",
"connect": "0ms",
"dyno": "web.1",
"fwd": "102.30.171.16",
"host": "store.app.com",
"path": "/cart_link",
"protocol": "https",
"service": "87ms",
"status": "304"
}parse_klog
fallible purevalue using the klog format used by Kubernetes components.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes |
Errors
Theparse_klog function is fallible, which means that
error handling is required for these errors:value does not match the klog format.parse_linux_authorization
fallible pure/var/log/auth.log (for Debian-based systems) or
/var/log/secure (for RedHat-based systems) according to Syslog format.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The text containing the message to parse. | yes |
Notices
This function has special behavior that you should be aware of.Errors
Theparse_linux_authorization function is fallible, which means that
error handling is required for these errors:value is not a properly formatted Syslog message.Examples
Parse Linux authorization event
parse_linux_authorization!(
s'Mar 23 2023 01:49:58 localhost sshd[1111]: Accepted publickey for eng from 10.1.1.1 port 8888 ssh2: RSA SHA256:foobar'
){
"appname": "sshd",
"hostname": "localhost",
"message": "Accepted publickey for eng from 10.1.1.1 port 8888 ssh2: RSA SHA256:foobar",
"procid": 1111,
"timestamp": "2023-03-23T01:49:58Z"
}parse_logfmt
fallible pureParses the value in logfmt.
- Keys and values can be wrapped using the
"character. "characters can be escaped by the\character.- As per this logfmt specification, the
parse_logfmtfunction accepts standalone keys and assigns them a Boolean value oftrue.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes |
Errors
Theparse_logfmt function is fallible, which means that
error handling is required for these errors:value is not a properly formatted key-value stringExamples
Parse logfmt log
parse_logfmt!(
"@timestamp=\"Sun Jan 10 16:47:39 EST 2021\" level=info msg=\"Stopping all fetchers\" tag#production=stopping_fetchers id=ConsumerFetcherManager-1382721708341 module=kafka.consumer.ConsumerFetcherManager"
){
"@timestamp": "Sun Jan 10 16:47:39 EST 2021",
"id": "ConsumerFetcherManager-1382721708341",
"level": "info",
"module": "kafka.consumer.ConsumerFetcherManager",
"msg": "Stopping all fetchers",
"tag#production": "stopping_fetchers"
}parse_nginx_log
fallible purecombined,
ingress_upstreaminfo, or error format.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes | |
| format | string | The format to use for parsing the log. | yes | |
| timestamp_format | string | The date/time format to use for encoding the timestamp. The time is parsed
in local time if the timestamp doesn’t specify a timezone. The default format is %d/%b/%Y:%T %z for
combined logs and %Y/%m/%d %H:%M:%S for error logs. | %d/%b/%Y:%T %z | no |
Notices
This function has special behavior that you should be aware of.-. These fields are omitted in the result.ingress_upstreaminfo format the following fields may be safely omitted in the log message: remote_addr, remote_user, http_referer, http_user_agent, proxy_alternative_upstream_name, upstream_addr, upstream_response_length, upstream_response_time, upstream_status.Errors
Theparse_nginx_log function is fallible, which means that
error handling is required for these errors:value does not match the specified format.timestamp_format is not a valid format string.value fails to parse using the provided timestamp_format.Examples
Parse via Nginx log format (combined)
parse_nginx_log!(
s'172.17.0.1 - alice [01/Apr/2021:12:02:31 +0000] "POST /not-found HTTP/1.1" 404 153 "http://localhost/somewhere" "Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/72.0.3626.119 Safari/537.36" "2.75"',
"combined",
){
"agent": "Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/72.0.3626.119 Safari/537.36",
"client": "172.17.0.1",
"compression": "2.75",
"referer": "http://localhost/somewhere",
"request": "POST /not-found HTTP/1.1",
"size": 153,
"status": 404,
"timestamp": "2021-04-01T12:02:31Z",
"user": "alice"
}Parse via Nginx log format (error)
parse_nginx_log!(
s'2021/04/01 13:02:31 [error] 31#31: *1 open() "/usr/share/nginx/html/not-found" failed (2: No such file or directory), client: 172.17.0.1, server: localhost, request: "POST /not-found HTTP/1.1", host: "localhost:8081"',
"error"
){
"cid": 1,
"client": "172.17.0.1",
"host": "localhost:8081",
"message": "open() \"/usr/share/nginx/html/not-found\" failed (2: No such file or directory)",
"pid": 31,
"request": "POST /not-found HTTP/1.1",
"server": "localhost",
"severity": "error",
"tid": 31,
"timestamp": "2021-04-01T13:02:31Z"
}Parse via Nginx log format (ingress_upstreaminfo)
parse_nginx_log!(
s'0.0.0.0 - bob [18/Mar/2023:15:00:00 +0000] "GET /some/path HTTP/2.0" 200 12312 "https://10.0.0.1/some/referer" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/111.0.0.0 Safari/537.36" 462 0.050 [some-upstream-service-9000] [some-other-upstream-5000] 10.0.50.80:9000 19437 0.049 200 752178adb17130b291aefd8c386279e7',
"ingress_upstreaminfo"
){
"body_bytes_size": 12312,
"http_referer": "https://10.0.0.1/some/referer",
"http_user_agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/111.0.0.0 Safari/537.36",
"proxy_alternative_upstream_name": "some-other-upstream-5000",
"proxy_upstream_name": "some-upstream-service-9000",
"remote_addr": "0.0.0.0",
"remote_user": "bob",
"req_id": "752178adb17130b291aefd8c386279e7",
"request": "GET /some/path HTTP/2.0",
"request_length": 462,
"request_time": 0.05,
"status": 200,
"timestamp": "2023-03-18T15:00:00Z",
"upstream_addr": "10.0.50.80:9000",
"upstream_response_length": 19437,
"upstream_response_time": 0.049,
"upstream_status": 200
}parse_proto
fallible purevalue as a protocol buffer payload.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The protocol buffer payload to parse. | yes | |
| desc_file | string | The path to the protobuf descriptor set file. Must be a literal string. This file is the output of protoc -o | yes | |
| message_type | string | The name of the message type to use for serializing. Must be a literal string. | yes |
Notices
This function has special behavior that you should be aware of.Only proto messages are parsed and returned.Errors
Theparse_proto function is fallible, which means that
error handling is required for these errors:value is not a valid proto payload.desc_file file does not exist.message_type message type does not exist in the descriptor file.parse_query_string
infallible purevalue as a query string.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes |
parse_regex
fallible pureParses the value using the provided Regex pattern.
This function differs from the parse_regex_all function in that it returns only the first match.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to search. | yes | |
| pattern | regex | The regular expression pattern to search against. | yes | |
| numeric_groups | regex | If true, the index of each group in the regular expression is also captured. Index 0
contains the whole match. | no |
Notices
This function has special behavior that you should be aware of.parse_regex function, see if a VRL parse_* function
already exists for your format. If not, we recommend opening an issue to request
support for the desired format.Errors
Theparse_regex function is fallible, which means that
error handling is required for these errors:value fails to parse using the provided pattern.parse_regex_all
fallible pureParses the value using the provided Regex pattern.
This function differs from the parse_regex function in that it returns all matches, not just the first.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to search. | yes | |
| pattern | regex | The regular expression pattern to search against. | yes | |
| numeric_groups | regex | If true, the index of each group in the regular expression is also captured. Index 0
contains the whole match. | no |
Notices
This function has special behavior that you should be aware of.parse_regex function, see if a VRL parse_* function
already exists for your format. If not, we recommend opening an issue to request
support for the desired format.Errors
Theparse_regex_all function is fallible, which means that
error handling is required for these errors:value fails to parse using the provided pattern.parse_ruby_hash
fallible purevalue as ruby hash.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string representation of the ruby hash to parse. | yes |
Notices
This function has special behavior that you should be aware of.string into a timestamp, consider the
parse_timestamp function.Errors
Theparse_ruby_hash function is fallible, which means that
error handling is required for these errors:value is not a valid ruby hash formatted payload.parse_syslog
fallible purevalue in Syslog format.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The text containing the Syslog message to parse. | yes |
Notices
This function has special behavior that you should be aware of.Errors
Theparse_syslog function is fallible, which means that
error handling is required for these errors:value is not a properly formatted Syslog message.Examples
Parse Syslog log (5424)
parse_syslog!(
s'<13>1 2020-03-13T20:45:38.119Z dynamicwireless.name non 2426 ID931 [exampleSDID@32473 iut="3" eventSource= "Application" eventID="1011"] Try to override the THX port, maybe it will reboot the neural interface!'
){
"appname": "non",
"exampleSDID@32473": {
"eventID": "1011",
"eventSource": "Application",
"iut": "3"
},
"facility": "user",
"hostname": "dynamicwireless.name",
"message": "Try to override the THX port, maybe it will reboot the neural interface!",
"msgid": "ID931",
"procid": 2426,
"severity": "notice",
"timestamp": "2020-03-13T20:45:38.119Z",
"version": 1
}parse_timestamp
fallible pureFunction spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The text of the timestamp. | yes | |
| format | string | The strptime format. | yes |
Errors
Theparse_timestamp function is fallible, which means that
error handling is required for these errors:value fails to parse using the provided format.parse_tokens
fallible pureParses the value in token format. A token is considered to be one of the following:
- A word surrounded by whitespace.
- Text delimited by double quotes:
"..". Quotes can be included in the token if they are escaped by a backslash (\). - Text delimited by square brackets:
[..]. Closing square brackets can be included in the token if they are escaped by a backslash (\).
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to tokenize. | yes |
Notices
This function has special behavior that you should be aware of.Errors
Theparse_tokens function is fallible, which means that
error handling is required for these errors:value is not a properly formatted tokenized string.parse_url
fallible purevalue in URL format.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The text of the URL. | yes | |
| default_known_ports | boolean | If true and the port number is not specified in the input URL
string (or matches the default port for the scheme), it is
populated from well-known ports for the following schemes:
http, https, ws, wss, and ftp. | no |
Errors
Theparse_url function is fallible, which means that
error handling is required for these errors:value is not a properly formatted URL.Examples
Parse URL
parse_url!("ftp://foo:bar@example.com:4343/foobar?hello=world#123"){
"fragment": "123",
"host": "example.com",
"password": "bar",
"path": "/foobar",
"port": 4343,
"query": {
"hello": "world"
},
"scheme": "ftp",
"username": "foo"
}Parse URL with default port
parse_url!("https://example.com", default_known_ports: true){
"fragment": null,
"host": "example.com",
"password": "",
"path": "/",
"port": 443,
"query": {},
"scheme": "https",
"username": ""
}parse_user_agent
infallible purevalue as a user agent string, which has a loosely defined format
so this parser only provides best effort guarantee.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to parse. | yes | |
| mode | string | Determines performance and reliability characteristics. | fast | no |
Notices
This function has special behavior that you should be aware of.null.Examples
Fast mode
parse_user_agent(
"Mozilla Firefox 1.0.1 Mozilla/5.0 (X11; U; Linux i686; de-DE; rv:1.7.6) Gecko/20050223 Firefox/1.0.1"
){
"browser": {
"family": "Firefox",
"version": "1.0.1"
},
"device": {
"category": "pc"
},
"os": {
"family": "Linux",
"version": null
}
}Reliable mode
parse_user_agent(
"Mozilla/4.0 (compatible; MSIE 7.66; Windows NT 5.1; SV1; .NET CLR 1.1.4322)",
mode: "reliable"
){
"browser": {
"family": "Internet Explorer",
"version": "7.66"
},
"device": {
"category": "pc"
},
"os": {
"family": "Windows XP",
"version": "NT 5.1"
}
}Enriched mode
parse_user_agent(
"Opera/9.80 (J2ME/MIDP; Opera Mini/4.3.24214; iPhone; CPU iPhone OS 4_2_1 like Mac OS X; AppleWebKit/24.783; U; en) Presto/2.5.25 Version/10.54",
mode: "enriched"
){
"browser": {
"family": "Opera Mini",
"major": "4",
"minor": "3",
"patch": "24214",
"version": "10.54"
},
"device": {
"brand": "Apple",
"category": "smartphone",
"family": "iPhone",
"model": "iPhone"
},
"os": {
"family": "iOS",
"major": "4",
"minor": "2",
"patch": "1",
"patch_minor": null,
"version": "4.2.1"
}
}parse_xml
fallible purevalue as XML.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string representation of the XML document to parse. | yes | |
| include_attr | boolean | Include XML tag attributes in the returned object. | true | no |
| attr_prefix | string | String prefix to use for XML tag attribute keys. | @ | no |
| text_key | string | Key name to use for expanded text nodes. | text | no |
| always_use_text_key | boolean | Always return text nodes as {"<text_key>": "value"}. | no | |
| parse_bool | boolean | Parse “true” and “false” as boolean. | true | no |
| parse_null | boolean | Parse “null” as null. | true | no |
| parse_number | boolean | Parse numbers as integers/floats. | true | no |
Notices
This function has special behavior that you should be aware of.Errors
Theparse_xml function is fallible, which means that
error handling is required for these errors:value is not a valid XML document.Examples
Parse XML
value = s'<book category="CHILDREN"><title lang="en">Harry Potter</title><author>J K. Rowling</author><year>2005</year></book>';
parse_xml!(value, text_key: "value", parse_number: false){
"book": {
"@category": "CHILDREN",
"author": "J K. Rowling",
"title": {
"@lang": "en",
"value": "Harry Potter"
},
"year": "2005"
}
}Random functions
random_bool
infallible purerandom_bytes
fallible pureFunction spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| length | integer | The number of bytes to generate. Must not be larger than 64k. | yes |
Errors
Therandom_bytes function is fallible, which means that
error handling is required for these errors:length is negative.length is larger than the maximum value (64k).random_float
fallible pureFunction spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| min | float | Minimum value (inclusive). | yes | |
| max | float | Maximum value (exclusive). | yes |
Errors
Therandom_float function is fallible, which means that
error handling is required for these errors:max is not greater than min.random_int
fallible pureFunction spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| min | integer | Minimum value (inclusive). | yes | |
| max | integer | Maximum value (exclusive). | yes |
Errors
Therandom_int function is fallible, which means that
error handling is required for these errors:max is not greater than min.uuid_from_friendly_id
fallible pureFunction spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | timestamp | A string that is a Friendly ID | yes |
Errors
Theuuid_from_friendly_id function is fallible, which means that
error handling is required for these errors:value is a string but the text uses characters outside of class [0-9A-Za-z].value is a base62 encoding of an integer, but the integer is greater than or equal to 2^128.uuid_v4
infallible pureuuid_v7
infallible pureString functions
camelcase
infallible purevalue string, and turns it into camelCase. Optionally, you can
pass in the existing case of the function, or else an attempt is made to determine the case automatically.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to convert to camelCase. | yes | |
| original_case | string | Optional hint on the original case type. Must be one of: kebab-case, camelCase, PascalCase, SCREAMING_SNAKE, snake_case | no |
community_id
infallible pureFunction spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| source_ip | string | The source IP address. | yes | |
| destination_ip | string | The destination IP address. | yes | |
| protocol | integer | The protocol number. | yes | |
| source_port | integer | The source port or ICMP type. | no | |
| destination_port | integer | The destination port or ICMP code. | no | |
| seed | integer | The custom seed number. | no |
contains
infallible purevalue string contains the specified substring.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The text to search. | yes | |
| substring | string | The substring to search for in value. | yes | |
| case_sensitive | boolean | Whether the match should be case sensitive. | true | no |
contains_all
infallible purevalue string contains all the specified substrings.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The text to search. | yes | |
| substrings | array | An array of substrings to search for in value. | yes | |
| case_sensitive | boolean | Whether the match should be case sensitive. | no |
downcase
infallible purevalue string, where downcase is defined according to the
Unicode Derived Core Property Lowercase.ends_with
infallible purevalue string ends with the specified substring.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to search. | yes | |
| substring | string | The substring with which value must end. | yes | |
| case_sensitive | boolean | Whether the match should be case sensitive. | true | no |
find
infallible purevalue
that matches pattern. Returns -1 if not found.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to find the pattern in. | yes | |
| pattern | regex string | The regular expression or string pattern to match against. | yes | |
| from | integer | Offset to start searching. | no |
join
infallible purevalue array into a single string, with items optionally separated from one another
by a separator.kebabcase
infallible purevalue string, and turns it into kebab-case. Optionally, you can
pass in the existing case of the function, or else we will try to figure out the case automatically.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to convert to kebab-case. | yes | |
| original_case | string | Optional hint on the original case type. Must be one of: kebab-case, camelCase, PascalCase, SCREAMING_SNAKE, snake_case | no |
match
infallible purevalue matches the pattern.match_any
infallible purevalue matches any of the given patterns. All
patterns are checked in a single pass over the target string, giving this
function a potential performance advantage over the multiple calls
in the match function.parse_float
infallible purevalue representing a floating point number in base 10 to a float.pascalcase
infallible purevalue string, and turns it into PascalCase. Optionally, you can
pass in the existing case of the function, or else we will try to figure out the case automatically.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to convert to PascalCase. | yes | |
| original_case | string | Optional hint on the original case type. Must be one of: kebab-case, camelCase, PascalCase, SCREAMING_SNAKE, snake_case | no |
redact
infallible pureRedact sensitive data in value such as:
- US social security card numbers
- Other forms of personally identifiable information with custom patterns
This can help achieve compliance by ensuring sensitive data does not leave your network.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string object array | The value to redact sensitive data from. The function’s behavior depends on
For arrays and objects, the function recurses into any nested arrays or objects. Any non-string elements are skipped. Redacted text is replaced with | yes | |
| filters | array | List of filters applied to Each filter can be specified in the following ways:
Named filters can be a:
See examples for more details. This parameter must be a static expression so that the argument can be validated at compile-time to avoid runtime errors. You cannot use variables or other dynamic expressions with it. | yes | |
| redactor | string object | Specifies what to replace the redacted strings with. It is given as an object with a “type” key specifying the type of redactor to use and additional keys depending on the type. The following types are supported:
As a convenience you can use a string as a shorthand for common redactor patterns:
This parameter must be a static expression so that the argument can be validated at compile-time to avoid runtime errors. You cannot use variables or other dynamic expressions with it. | no |
Examples
Replace text using a regex
redact("my id is 123456", filters: [r'\d+'])my id is [REDACTED]Replace us social security numbers in any field
redact({ "name": "John Doe", "ssn": "123-12-1234"}, filters: ["us_social_security_number"]){
"name": "John Doe",
"ssn": "[REDACTED]"
}Replace with custom text
redact("my id is 123456", filters: [r'\d+'], redactor: {"type": "text", "replacement": "***"})my id is ***Replace with SHA-2 hash
redact("my id is 123456", filters: [r'\d+'], redactor: "sha2")my id is GEtTedW1p6tC094dDKH+3B8P+xSnZz69AmpjaXRd63I=replace
infallible pureReplaces all matching instances of pattern in value.
The pattern argument accepts regular expression capture groups.
Note when using capture groups:
- You will need to escape the
$by using$$to avoid Vector interpreting it as an environment variable when loading configuration - If you want a literal
$in the replacement pattern, you will also need to escape this with$$. When combined with environment variable interpolation in config files this means you will need to use$$$$to have a literal$in the replacement pattern.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The original string. | yes | |
| pattern | regex string | Replace all matches of this pattern. Can be a static string or a regular expression. | yes | |
| with | string | The string that the matches are replaced with. | yes | |
| count | integer | The maximum number of replacements to perform. -1 means replace all matches. | -1 | no |
replace_with
infallible pureReplaces all matching instances of pattern using a closure.
The pattern argument accepts a regular expression that can use capture groups.
The function uses the function closure syntax to compute the replacement values.
The closure takes a single parameter, which is an array, where the first item is always
present and contains the entire string that matched pattern. The items from index one on
contain the capture groups of the corresponding index. If a capture group is optional, the
value may be null if it didn’t match.
The value returned by the closure must be a string and will replace the section of the input that was matched.
This returns a new string with the replacements, the original string is not mutated.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The original string. | yes | |
| pattern | regex | Replace all matches of this pattern. Must be a regular expression. | yes | |
| count | integer | The maximum number of replacements to perform. -1 means replace all matches. | -1 | no |
Examples
Capitalize words
replace_with("apples and bananas", r'\b(\w)(\w*)') -> |match| {
upcase!(match.captures[0]) + string!(match.captures[1])
}Apples And BananasReplace with hash
replace_with("email from test@example.com", r'\w+@example.com') -> |match| {
sha2(match.string, variant: "SHA-512/224")
}email from adf6e1bc4415d24912bd93072ad34ef825a7b6eb3bf53f68def1fc17screamingsnakecase
infallible purevalue string, and turns it into SCREAMING_SNAKE case. Optionally, you can
pass in the existing case of the function, or else we will try to figure out the case automatically.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to convert to SCREAMING_SNAKE case. | yes | |
| original_case | string | Optional hint on the original case type. Must be one of: kebab-case, camelCase, PascalCase, SCREAMING_SNAKE, snake_case | no |
sieve
infallible pureKeeps only matches of pattern in value.
This can be used to define patterns that are allowed in the string and remove everything else.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The original string. | yes | |
| pattern | regex | Keep all matches of this pattern. | yes | |
| replace_single | string | The string to use to replace single rejected characters. | no | |
| replace_repeated | string | The string to use to replace multiple sequential instances of rejected characters. | no |
slice
infallible pureReturns a slice of value between the start and end positions.
If the start and end parameters are negative, they refer to positions counting from the right of the
string or array. If end refers to a position that is greater than the length of the string or array,
a slice up to the end of the string or array is returned.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | array string | The string or array to slice. | yes | |
| start | integer | The inclusive start position. A zero-based index that can be negative. | yes | |
| end | integer | The exclusive end position. A zero-based index that can be negative. | String length | no |
snakecase
infallible purevalue string, and turns it into snake-case. Optionally, you can
pass in the existing case of the function, or else we will try to figure out the case automatically.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to convert to snake-case. | yes | |
| original_case | string | Optional hint on the original case type. Must be one of: kebab-case, camelCase, PascalCase, SCREAMING_SNAKE, snake_case | no |
split
infallible purevalue string using pattern.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to split. | yes | |
| pattern | string regex | The string is split whenever this pattern is matched. | yes | |
| limit | integer | The maximum number of substrings to return. | no |
starts_with
infallible purevalue begins with substring.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to search. | yes | |
| substring | string | The substring that the value must start with. | yes | |
| case_sensitive | boolean | Whether the match should be case sensitive. | true | no |
strip_ansi_escape_codes
infallible purevalue.strip_whitespace
infallible purevalue, where whitespace is defined by the Unicode
White_Space property.truncate
infallible purevalue string up to the limit number of characters.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | string | The string to truncate. | yes | |
| limit | integer float | The number of characters to truncate the string after. | yes | |
| ellipsis | boolean | This argument is deprecated. An ellipsis (...) is appended if the parameter is set to true and the value string
is truncated because it exceeded the limit. | no | |
| suffix | string | A custom suffix (...) is appended to truncated strings.
If ellipsis is set to true, this parameter is ignored for backwards compatibility. | no |
upcase
infallible purevalue, where upcase is defined according to the Unicode Derived Core Property
Uppercase.System functions
get_env_var
fallible purename.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| name | string | The name of the environment variable. | yes |
Errors
Theget_env_var function is fallible, which means that
error handling is required for these errors:name does not exist.name is not valid Unicodeget_hostname
fallible pureFunction spec
Errors
Theget_hostname function is fallible, which means that
error handling is required for these errors:get_timezone_name
fallible purelocal, then it attempts to
determine the name of the timezone from the host OS. If this
is not possible, then it returns the fixed offset of the
local timezone for the current time in the format "[+-]HH:MM",
for example, "+02:00".Function spec
Errors
Theget_timezone_name function is fallible, which means that
error handling is required for these errors:Timestamp functions
format_timestamp
infallible purevalue into a string representation of the timestamp.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | timestamp | The timestamp to format as text. | yes | |
| format | string | The format string as described by the Chrono library. | yes | |
| timezone | string | The timezone to use when formatting the timestamp. The parameter uses the TZ identifier or local. | no |
now
infallible pureType functions
array
fallible purevalue if it is an array, otherwise returns an error. This enables the type checker to guarantee that the
returned value is an array and can be used in any function that expects an array.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | any | The value to check if it is an array. | yes |
Errors
Thearray function is fallible, which means that
error handling is required for these errors:value is not an array.bool
fallible purevalue if it is a Boolean, otherwise returns an error. This enables the type checker to guarantee that the
returned value is a Boolean and can be used in any function that expects a Boolean.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | any | The value to check if it is a Boolean. | yes |
Errors
Thebool function is fallible, which means that
error handling is required for these errors:value is not a Boolean.float
fallible purevalue if it is a float, otherwise returns an error. This enables the type checker to guarantee that the
returned value is a float and can be used in any function that expects a float.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | any | The value to check if it is a float. | yes |
Errors
Thefloat function is fallible, which means that
error handling is required for these errors:value is not a float.int
fallible purevalue if it is an integer, otherwise returns an error. This enables the type checker to guarantee that the
returned value is an integer and can be used in any function that expects an integer.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | any | The value to check if it is an integer. | yes |
Errors
Theint function is fallible, which means that
error handling is required for these errors:value is not an integer.is_array
infallible purevalue’s type is an array.is_boolean
infallible purevalue’s type is a boolean.is_empty
infallible pure0.is_float
infallible purevalue’s type is a float.is_integer
infallible pureis_json
infallible pureis_nullish
infallible purevalue is nullish, where nullish denotes the absence of a
meaningful value.is_object
infallible purevalue’s type is an object.is_regex
infallible purevalue’s type is a regex.is_string
infallible purevalue’s type is a string.is_timestamp
infallible purevalue’s type is a timestamp.object
fallible purevalue if it is an object, otherwise returns an error. This enables the type checker to guarantee that the
returned value is an object and can be used in any function that expects an object.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | any | The value to check if it is an object. | yes |
Errors
Theobject function is fallible, which means that
error handling is required for these errors:value is not an object.string
fallible purevalue if it is a string, otherwise returns an error. This enables the type checker to guarantee that the
returned value is a string and can be used in any function that expects a string.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | any | The value to check if it is a string. | yes |
Errors
Thestring function is fallible, which means that
error handling is required for these errors:value is not a string.tag_types_externally
infallible pureAdds type information to all (nested) scalar values in the provided value.
The type information is added externally, meaning that value has the form of "type": value after this
transformation.
Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | any | The value to tag with types. | yes |
Examples
Tag types externally (object)
tag_types_externally({
"message": "Hello world",
"request": {
"duration_ms": 67.9
}
}){
"message": {
"string": "Hello world"
},
"request": {
"duration_ms": {
"float": 67.9
}
}
}timestamp
fallible purevalue if it is a timestamp, otherwise returns an error. This enables the type checker to guarantee that
the returned value is a timestamp and can be used in any function that expects a timestamp.Function spec
| Argument | Type | Description | Default | Required? |
|---|---|---|---|---|
| value | any | The value to check if it is a timestamp. | yes |
Errors
Thetimestamp function is fallible, which means that
error handling is required for these errors:value is not a timestamp.