Linux sort命令简介

用sort对文件排序，发现这个命令比想象中要复杂和强大，仔细研究了一下文档，记录一下。

首先看一下文档，建议浏览一下，用的时候再详细看看：

$ sort --help用法：sort [选项]... [文件]...　或：sort [选项]... --files0-from=FWrite sorted concatenation of all FILE(s) to standard output.如果没有指定文件，或者文件为"-"，则从标准输入读取。必选参数对长短选项同时适用。排序选项：  -b, --ignore-leading-blanks  忽略前导的空白区域  -d, --dictionary-order  只考虑空白区域和字母字符  -f, --ignore-case    忽略字母大小写  -g, --general-numeric-sort  compare according to general numerical value  -i, --ignore-nonprinting    consider only printable characters  -M, --month-sort            compare (unknown) < 'JAN' < ... < 'DEC'  -h, --human-numeric-sort    使用易读性数字(例如： 2K 1G)  -n, --numeric-sort          compare according to string numerical value  -R, --random-sort           shuffle, but group identical keys.  See shuf(1)      --random-source=FILE    get random bytes from FILE  -r, --reverse               reverse the result of comparisons      --sort=WORD    按照WORD 指定的格式排序：          一般数字-g，高可读性-h，月份-M，数字-n，          随机-R，版本-V  -V, --version-sort    在文本内进行自然版本排序其他选项：      --batch-size=NMERGE  一次最多合并NMERGE 个输入；如果输入更多          则使用临时文件  -c, --check, --check=diagnose-first  检查输入是否已排序，若已有序则不进行操作  -C, --check=quiet, --check=silent  类似-c，但不报告第一个无序行      --compress-program=程序  使用指定程序压缩临时文件；使用该程序          的-d 参数解压缩文件      --debug      为用于排序的行添加注释，并将有可能有问题的          用法输出到标准错误输出      --files0-from=文件  从指定文件读取以NUL 终止的名称，如果该文件被          指定为"-"则从标准输入读文件名  -k, --key=KEYDEF          sort via a key; KEYDEF gives location and type  -m, --merge               merge already sorted files; do not sort  -o, --output=文件    将结果写入到文件而非标准输出  -s, --stable      禁用last-resort 比较以稳定比较算法  -S, --buffer-size=大小  指定主内存缓存大小  -t, --field-separator=分隔符  使用指定的分隔符代替非空格到空格的转换  -T, --temporary-directory=目录  使用指定目录而非$TMPDIR 或/tmp 作为          临时目录，可用多个选项指定多个目录      --parallel=N    将同时运行的排序数改变为N  -u, --unique    配合-c，严格校验排序；不配合-c，则只输出一次排序结果  -z, --zero-terminated     line delimiter is NUL, not newline      --help    显示此帮助信息并退出      --version    显示版本信息并退出KEYDEF is F[.C][OPTS][,F[.C][OPTS]] for start and stop position, where F is afield number and C a character position in the field; both are origin 1, andthe stop position defaults to the line's end.  If neither -t nor -b is ineffect, characters in a field are counted from the beginning of the precedingwhitespace.  OPTS is one or more single-letter ordering options [bdfgiMhnRrV],which override global ordering options for that key.  If no key is given, usethe entire line as the key.  Use --debug to diagnose incorrect key usage.SIZE may be followed by the following multiplicative suffixes:% 1% of memory, b 1, K 1024 (default), and so on for M, G, T, P, E, Z, Y.*** WARNING ***The locale specified by the environment affects sort order.Set LC_ALL=C to get the traditional sort order that usesnative byte values.GNU coreutils online help: <http://www.gnu.org/software/coreutils/>请向<http://translationproject.org/team/zh_CN.html> 报告sort 的翻译错误Full documentation at: <http://www.gnu.org/software/coreutils/sort>or available locally via: info '(coreutils) sort invocation'

它的最基本用法就是”sort -k2,2 file”，表示排序的key开始列是2，结束列是2，也就是按照第二列排序。

下面通过一个例子来逐步了解更加复杂的用法。假设我们要排序的文件为：