Back to commands

Cybersecurity Triage

Read-only

Count the Most Common User Agents

Traffic changed suddenly and you need a quick view of which client-reported user agents dominate the access log.

Command

awk -F'"' '{print $6}' ./fixtures/nginx/access.log | sort | uniq -c | sort -nr | head

Before you run this

System impact: Read-only. Low when scoped to the shown target.

When not to use it: Do not trust user agents as identity. They are client-supplied strings and can be spoofed or missing.

Expected output

A descending count of user-agent strings, useful as a lead but not as proof of who made the requests.

System impact

Read-only. Nothing changes. The command extracts the user-agent field and counts repeated values; user-agent strings may include unusual client data.

Recovery / rollback: no state is changed.

When to use it

Use this when investigating scraping, monitoring noise, broken clients, or sudden traffic-composition changes.

When not to use it

Do not trust user agents as identity. They are client-supplied strings and can be spoofed or missing.

Watch this command run

Command transcript

This sanitized transcript shows the commands and output shape without exposing host details.

demo@lab:~$

$ awk -F'"' '{print $6}' ./sample-files/nginx/access.log | head

Mozilla/5.0
Mozilla/5.0
Mozilla/5.0
ScannerBot/1.0
ScannerBot/1.0
ScannerBot/1.0
ScannerBot/1.0
ScannerBot/1.0
SyntheticAudit/0.1
SyntheticAudit/0.1

$ awk -F'"' '{print $6}' ./sample-files/nginx/access.log | sort | uniq -c | sort -nr | head

     11 Mozilla/5.0
      5 UptimeCheck/2.0
      5 ScannerBot/1.0
      2 curl/8
      2 SyntheticAudit/0.1

$ awk -F'"' '{print $6}' ./sample-files/nginx/access.log | sort | uniq -c | sort -nr | tail

     11 Mozilla/5.0
      5 UptimeCheck/2.0
      5 ScannerBot/1.0
      2 curl/8
      2 SyntheticAudit/0.1
View commands shown

These are the commands shown in the sanitized transcript.

Commands shown

  1. awk -F'"' '{print $6}' ./fixtures/nginx/access.log | head
  2. awk -F'"' '{print $6}' ./fixtures/nginx/access.log | sort | uniq -c | sort -nr | head
  3. awk -F'"' '{print $6}' ./fixtures/nginx/access.log | sort | uniq -c | sort -nr | tail

next steps

Related commands

Cybersecurity Triage Read-only

Find Common Admin Probe Paths

A site does not need WordPress to receive WordPress-looking probes.

awk '$7 ~ /(admin|login|wp-|phpmyadmin)/ {print $1, $7, $9}' ./fixtures/nginx/access.log | sort | uniq -c | sort -nr | head
Cybersecurity Triage Read-only

Find the IPs Creating the Most 4xx Noise

One address can turn a normal access log into a wall of failed requests.

awk '$9 ~ /^4/ {count[$1]++} END {for (ip in count) print count[ip], ip}' ./fixtures/nginx/access.log | sort -nr | head
Cybersecurity Triage Read-only

Spot Unusual HTTP Methods in Access Logs

Most site traffic is boring. The weird methods are worth a look.

awk '$6 !~ /^"(GET|POST|HEAD|OPTIONS)$/ {print $1, $6, $7, $9}' ./fixtures/nginx/access.log | sort | uniq -c | sort -nr
Cybersecurity Triage Read-only

Spot Request Bursts by Minute

Traffic spikes are easier to read when you bucket them by time.

awk '{minute=substr($4,2,17); count[minute]++} END {for (m in count) print count[m], m}' ./fixtures/nginx/access.log | sort -nr | head
Cybersecurity Triage Read-only

Find Paths Repeatedly Returning 404

One missing URL is normal. A repeated missing URL is a signal.

awk '$9==404 {count[$7]++} END {for (path in count) if (count[path] >= 3) print count[path], path}' ./fixtures/nginx/access.log | sort -nr | head
Study mapping

Use this as independent command practice: read the notes, predict the output, then compare it with the example before using a real shell.

  • lpic1:103-gnu-unix-commands
  • lpic1:110-security
  • lfcs:essential-commands
  • lfcs:security-hygiene
  • linuxplus:automation-scripting
  • linuxplus:provisional
  • risk:read-only

Useful for

  • LPIC-1 style command-line practice
  • LFCS style performance tasks
  • Linux+ style troubleshooting review

Independent study support only. No affiliation, endorsement, exam dumps, or real exam questions.