Gawk!

GNU Wget is a free software http://gnu.org/philosophy/free-sw
package for retrieving files using HTTP, HTTPS and FTP, the most widely-used Internet protocols. It is a non-interactive commandline tool, so it may easily be called from scripts, `cron` jobs, terminals without X-Windows support, etc. GNU Wget has many features to make retrieving large files or mirroring entire web or FTP sites easy, including: * Can resume aborted downloads, using `REST` and `RANGE` * Can use filename wild cards and recursively mirror directories * NLS-based message files for many different languages * Optionally converts absolute links in downloaded documents to relative, so that downloaded documents may link to each other locally * Runs on most UNIX-like operating systems as well as Microsoft

Gawk alternatives

  • ripgrep

  • ripgrep is a command line search tool that combines the usability of The Silver Searcher (an ack clone) with the raw speed of GNU grep.

    tags: search-tool grep
  • datamash

  • GNU datamash is command-line program which performs simple calculation (e.g. count, sum, min, max, mean, stdev, string coalescing) on input files. Datamash has a rich set of statistical functions, to quickly assess information in textual input files. An example of calculating basic statistic (mean, 1st quartile, median, 3rd quarile, IQR, sample-standard-deviation, and p-value of Jarque-Bera test for normal distribution:

    tags: data-analysis command-line-interface statistical-analysis text-analysis
  • Mawk

  • mawk is an interpreter for the AWK Programming Language.

    tags: awk interpreter programming-language