A variable summary file (or list file), which contains histogram information showing the variable's distribution in the data file, is created for each variable (or designated variables) in the specified data file. You can optionally specify an output file in which a summary of processing activity is saved.
Variable summaries (list files) can be helpful for performing quality
control checks of data. For example, you could run checkvar on
an ASCII file, convert the file to binary, and then run checkvar
on the binary file. The output from checkvar should be the same
for both the ASCII and binary files. You can also use variable
summaries to look at the data distribution in a data set before
extracting data.
The checkvar command has the following form:
checkvarinput_file [-fformat_file] [-ifinput_format_file] [-ofoutput_format_file][
-ft "title"] [-ift "title"] [-oft "title"] [-blocal_buffer_size] [-ccount] [-vvar_file] [-qquery_file] [-pprecision] [-mmaxbins] [-mdmissing_data_flag] [-mm] [-oprocessing_summary]
The checkvar program needs to find only an input format
description. Output format descriptions will be ignored. If conversion
variables are included in input or output formats, no conversion is
performed when you run checkvar, since it ignores output
formats.
For descriptions of the standard arguments (first eleven arguments above), see Section 8.6.
-p precisioncheckvar
output.
The default is 0; maximum is 5.
NOTE: If you use the -p option on the command line, the precision
set in the relevant format file is overridden. The precision in
the format file serves as the default.
-m maxbinscheckvar output. The checkvar program keeps
track of the number of bins filled as the data is processed. The
smaller the number of bins, the faster checkvar runs. By
keeping the number of bins small, you can check the gross aspects of
data distribution rather than the details.
The number of bins is adjusted dynamically as checkvar runs
depending on the distribution of data in the input file. If the
number of filled bins becomes > 1.5 * maxbins, the width of the bins
is doubled to keep the total number near the desired maximum.
The default is 100 bins; minimum is 6. Must be < 10,000.
NOTE: The precision (-p) and maxbins (-m) options have no effect on character variables.
-md missing_data_flagcheckvar should
ignore across all variables in creating histogram data. Missing data
flags are used in a data file to indicate missing or meaningless
data. If you want checkvar to ignore more than one value, use
the query (-q) option in conjunction with the variable file
(-v) option.
-mm-o processing_summary