Sumo Logic Cheat Sheet

Metadata

_sourceHost	The host name of the Source. For local Sources the name of the Source is set when you configure the Source. For remote Collectors, this field uses the remote host's name. The _sourceHost metadata field is populated using a reverse DNS lookup. If the name cannot be resolved, _sourceHost is displayed as localhost.	`_sourceHost=MySQL`
_sourceName	The name of the log file, determined by the path you entered when you configured the Source.	`_sourceName=/path/to/file/ _sourceName=path _sourcename = "/var/log/tomcat/logs/foobar.log"`
_sourceCategory	This field is created when you enter text into the Source Category field at Source configuration time. Log categories can be somewhat complex, as many log files may belong to more than one logical category.	`_sourceCategory=OS*`
		`_sourceCategory=Application`
		`_sourceCategory=*Audit`
_collector	Returns results from the named Collector only. Entered when a Collector is installed and activated.	`_collector=public_cloud`
_source	Returns results from the named Source only. Entered when a Source is configured.	`_source=syslog`

While

_sourcename = *api.log

works,

_sourcename = "*api.log"

will fail.
List all categories:

* | count by _sourceCategory | fields -_count

Input format

keyvalue	For KVP type logs. The keyvalue operator allows you to get values from a log message by specifying the key paired with each value.	`\| keyvalue "age" \| keyvalue infer "hairColor", "lastVisit" \| keyvalue regex "=(.*?)[,\|}]" keys "serviceinfo.IP", "loggingcontext.region", "request.method" as ip, region, method \| keyvalue auto`
csv	The csv operator allows you to parse Comma Separated Values (CSV) formatted log entries. It uses a comma as the default delimiter.
	Parse comma delimited fields	`\| csv_raw extract 1 as user, 2 as id, 3 as name`
	Parse a stream query and extract search terms	`"Starting stream query" \| parse "query=[*], queryId" as query \| csv query extract searchTerms, op1, op2, op3`
	Specify an escape, and quote character	`csv fieldName escape='\', quote=''' extract A, B, _, _, E, F`
JSON	The JSON operator allows you to extract values from JSON input. Because JSON supports both nested keys and arrays that contain ordered sequences of values, the Sumo Logic JSON operator allows you to extract single top-level fields, multiple fields, nested keys and keys in arrays.
	Extracting a single top-level field	`_sourceCategory=stream RawOutputProcessor "\"message\"" \| parse "explainJsonPlan.stream]*" as jsonobject \| json field=jsonobject "sessionId" \| fields -jsonobject`
	Extracting multiple fields	`_sourceCategory=stream RawOutputProcessor "\"message\"" \| parse "explainJsonPlan.stream]*" as jsonobject \| json field=jsonobject "sessionId", "customerId" \| fields -jsonobject`
	Extracting a nested key	`* \| json field=jsonobject "meta.type"`
	Finding values in a JSON array	`* \| json field=jsonobject "baselineIntervals"`
	Refer to one specific entry in an array	`* \| json field=jsonobject "baselineIntervals[1]"`
	Using the nodrop option	`* \| json field=jsonobject "baselineIntervals[0]" nodrop`
	Note: The JSON operator also supports the nodrop option, which allows messages containing invalid JSON values to be displayed.
	Using wildcard (*)	`_sourceCategory=O365* \| json "Actor[*].Type" as Actortype`
	json auto works by searching for json blobs beginning at the end of the message. Usually logs begin with a preamble, such as a timestamp. In cases where content appears at the end of the message after the json blob, the extraction could fail. Having the json blob at the end of the message is recommended, as having it in the middle could cause extraction failure.	`\| json auto`

KVP: Key-Value Pairs. Logs formatted this way look something like this:

[2019-12-24 23:59:59.380 +1100] age=42 name="Rick Deckard" hairColor="brown" lastVisit="2018-04-19 13:00"

infer: Default mode. Uses an internal list of regex to extract the value for a key.
regex: In Regular Expression mode, you must explicitly match keys and values based on a regex.
auto: Extract up to N fields. N is 100 by default.

Conditions

if	There are two forms of ternary expression you can use in Sumo Logic queries: one is constructed using the IF operator, and the other uses the question mark (?) operator. These expressions are used to evaluate a condition as either true or false, with values assigned for each outcome. It is a shorthand way to express an if-else condition.	`\| if(status_code matches "5*", 1, 0) as server_error`
		`\| status_code matches "5*" ? 1 : 0 as server_error`
in	The In operator returns a Boolean value: true if the specified property is in the specified object, or false if it is not.	`\| if (status_code in ("500", "501", "502", "503", "504", "505", "506", "401", "402", "403", "404"), "Error", "OK") as status_code_type`
where	The where operator must appear as a separate operator distinct from other operators, delimited by the pipe symbol ("\|").	`//We recommend placing inclusive filters before exclusive filters in query strings`
		`\| where status_code matches "4*"`
		`\| where !(status_code matches "2*")`
isBlank	The isBlank operator checks to see that a string contains text. Specifically, it checks to see if a character sequence is whitespace, empty ("") ,or null. It takes a single parameter and returns a Boolean value: true if the variable is indeed blank, or false if the variable contains a value other than whitespace, empty, or null.	`\| where isBlank(user)`
isEmpty	The isEmpty operator checks to see that a string contains text. Specifically, it checks to see whether a character sequence is empty ("") or null. It takes a single parameter and return a Boolean value: true if the variable is indeed empty, or false if the variable contains a value other than empty or null.	`\| if(isEmpty(src_ip),1,0) as null_ip_counts`
isNull	The isNull operator takes a single parameter and returns a Boolean value: True if the variable is indeed null, or false if the variable contains a value other than null.	`\| where isNull(src_ip)`

Data extraction

parse(regex)	Best for variable patterns. Also called the extract operator; enables users to extract more complex data from log lines using regular expressions. Can be used to extract nested fields.	`\| parse "Content=*:" as content`
	Parsing an IP address	`\| parse regex "(?<ip_address>\d{1,3}\.\d{1,3}\.\d{1,3}\.\d{1,3}) "`
	Indicating an OR condition to use non-capturing groups	`\| parse regex "list 101 (accepted\|denied) (?<protocol>.*?) "`
parse(anchor)	Best for predictable patterns. Also called parse anchor, parses strings according to specified start and stop anchors and labels them as fields for use in subsequent aggregation functions in the query such as sorting, grouping...	`\| parse "User=*:" as user`
split	The split operator allows you to split strings into multiple strings, and parse delimited log entries, such as space-delimited formats.	`_sourceCategory=colon \| parse "] " as log_level, text \| split text delim=':' extract 1 as user, 2 as account_id, 3 as session_id, 4 as result`
fields	The fields operator allows you to choose which fields are displayed in the results of a query.	`_sourceCategory=access_logs \| parse using public/apache \| fields method, status_code`
limit	The limit operator reduces the number of raw messages or aggregate results returned.	`\| count by _sourceCategory \| sort by _count \| limit 5`
matches	The matches operator can be used to match a string to a pattern.	`\| if (agent matches "MSIE","Internet Explorer","Other") as Browser \| if (agent matches "Firefox","Firefox",Browser) as Browser`
timeslice	The timeslice operator segregates data by time period.	`\| timeslice 1h \| count by _timeslice`
		`_sourcename=tomcat \| timeslice by 5m \| count by _timeslice`
	Output of last example:	`# Time _count 1 09/07/2017 11:25:00 AM +1000 9,234 2 09/07/2017 11:30:00 AM +1000 14,496 3 09/07/2017 11:35:00 AM +1000 15,988 4 09/07/2017 11:40:00 AM +1000 3,383`
trace	A trace operator acts as a highly sophisticated filter to connect the dots across different log messages. You can use any identifying value with a trace operator (such as a user ID, IP address, session ID, etc.) to retrieve a comprehensive set of activity associated to that original ID.	`\| trace "ID=( [0-9a-fA-F] {4} )" "7F92"`

About limit: Can be used in Dashboard Panels, but in the search they must be included after the first group-by phrase.
About timeslice: Timeslices greater than 1 day cannot be used in Dashboard Live mode.
About trace: Not supported in Live Dashboards or any continuous query.

Crunch numbers

count count_distinct count_frequent	Used in conjunction with the group operator and a field name. Only the word by is required. The count function is also an operator in its own right and therefore can be used with or without the word by.	`\| count by url \| count_distinct(referrer) by status_code _sourcename=tomcat \| count_distinct(_sourceName) group by _sourceHost \| sort by _count_distinct desc`
sum	Sum adds the values of the numerical field being evaluated within the time range analyzed.	`\| sum(bytes_received) group by _sourceHost`
avg	The averaging function (avg) calculates the average value of the numerical field being evaluated within the time range analyzed.	`\| avg(request_received) by _timeslice`
median	In order to calculate the median value for a particular field, you can utilize the Percentile (pct) operator with a percentile argument of 50.	`\| parse "value=*" as value \| pct(value, 50) as median`
outlier	Given a series of time-stamped numerical values, using the outlier operator in a query can identify values in a sequence that seem unexpected, and would identify an alert or violation, for example, for a scheduled search.	`_sourceCategory=IIS/Access \| parse regex "\d+-\d+-\d+ \d+:\d+:\d+ (?<server_ip>\S+) (?<method>\S+) (?<cs_uri_stem>/\S+?) \S+ \d+ (?<user>\S+) (?<client_ip>[\.\d]+) " \| parse regex "\d+ \d+ \d+ (?<response_time>\d+)$" \| timeslice 1m \| max(response_time) as response_time by _timeslice \| outlier response_time window=5,threshold=3,consecutive=2,direction=+-`
sort	The sort operator orders aggregated search results. The default sort order is descending.	`\| count as page_hits by _sourceHost \| sort by page_hits asc`
top	Use the top operator with the sort operator, to reduce the number of sorted results returned.	`\| top 5 _sourcecategory`
min	The minimum function returns the smaller of two values.	`\| min(1, 2) as v // v = 1`
max	The maximum function returns the larger of two values.	`\| max(1, 2) as v // v = 2`

About count_frequent: You can use the count_frequent operator in Dashboard queries, but the number of results returned is limited to the top 100 most frequent results.
About top: Can be used in Dashboard Panels, but in the search they must be included after the first group-by phrase.

Geo lookup

Sumo Logic can match an extracted IP address to it's geographical location on a map. To create the map, after parsing the IP addresses from log files, the lookup operator matches extracted IP addresses to the physical location where the addresses originated.

 | parse "remote_ip=*]" as remote_ip | lookup latitude, longitude, country_code, country_name, region, city, postal_code, area_code, metro_code fromgeo://default on ip = remote_ip | count by latitude, longitude, country_code, country_name, region, city, postal_code, area_code, metro_code | sort _count

logcompare

The logcompare operator allows you to compare two sets of logs: baseline (historical) and target (current). To run a LogCompare operation, you can use the LogCompare button on the Messages tab to generate a properly formatted query

 | logcompare timeshift -24h

About logcompare: Not supported in Dashboards.

logreduce

The LogReduce algorithm uses fuzzy logic to cluster messages together based on string and pattern similarity. Use the LogReduce button and operator to quickly assess activity patterns for things like a range of devices or traffic on a website.

 | logreduce

About logreduce: Not supported in Dashboards.

save

Using the Save operator allows you to save the results of a query into the Sumo Logic file system. Later, you can use the lookup operator to access the saved data. The Save operator saves data in a simple format to a location you choose.

 | save /shared/lookups/daily_users

About save: Not supported in Dashboards.

Visualization

transpose	Turn a list into a table in the Aggregates tab.	`transpose row [row fields] column [column fields]`
		`_sourceCategory=Labs/Apache/Access \| timeslice 5m \| count by _timeslice, status_code \| transpose row _timeslice column status_code`

Created By

TME520
https://tme520.com

Languages: English

Published: 4th September, 2017
Last Updated: 26th April, 2020

Comments

No comments yet. Add yours below!

Add a Comment

More Cheat Sheets by TME520

Lantern Light for MSDOS keyboard mapping Cheat Sheet

Top 30 linux shell tricks Cheat Sheet

Anki Vector Cheat Sheet

Sumo Logic Cheat Sheet by TME520

Metadata

Input format

Conditions

Data extraction

Crunch numbers

Geo lookup

logcompare

logreduce

save

Visualization

Created By

Metadata

Comments

Add a Comment

More Cheat Sheets by TME520

Latest Cheat Sheet

Random Cheat Sheet

About Cheatography

Behind the Scenes

Recent Cheat Sheet Activity

Please Disable Your Ad Blocker

Sumo Logic Cheat Sheet by TME520

Metadata

Input format

Conditions

Data extraction

Crunch numbers

Geo lookup

logcompare

logreduce

save

Visual­ization

Created By

Metadata

Comments

Add a Comment

More Cheat Sheets by TME520

Latest Cheat Sheet

Random Cheat Sheet

About Cheatography

Behind the Scenes

Recent Cheat Sheet Activity

Please Disable Your Ad Blocker

Visualization