refer(1)                                             General Commands Manual                                             refer(1)

Name
       refer - process bibliographic references for groff

Synopsis
       refer [-bCenPRS] [-a n] [-B field.macro] [-c fields] [-f n] [-i fields] [-k field] [-l range-expression] [-p database-
             file] [-s fields] [-t n] [file ...]

       refer --help

       refer -v
       refer --version

Description
       The GNU implementation of refer is part of the groff(1) document formatting system.  refer is a troff(1) preprocessor that
       prepares  bibilographic citations by looking up keywords specified in a roff(7) input document, obviating the need to type
       such annotations, and permitting the citation style in formatted output to be altered  independently  and  systematically.
       It  copies  the  contents of each file to the standard output stream, except that it interprets lines between .[ and .] as
       citations to be translated into groff input, and lines between .R1 and .R2 as instructions regarding how citations are  to
       be  processed.  Normally, refer is not executed directly by the user, but invoked by specifying the -R option to groff(1).
       If no file operands are given on the command line, or if file is “-”, the standard input stream is read.

       Each citation specifies a reference.  The citation can specify a reference that is contained in a  bibliographic  database
       by  giving  a  set of keywords that only that reference contains.  Alternatively it can specify a reference by supplying a
       database record in the citation.  A combination of these alternatives is also possible.

       For each citation, refer can produce a mark in the text.  This mark consists of some label which can be separated from the
       text and from other labels in various ways.  For each reference it also outputs groff(7) language  commands  that  can  be
       used  by  a  macro  package  to  produce  a  formatted reference for each citation.  The output of refer must therefore be
       processed using a suitable macro package, such as me, mm, mom, or ms.  The commands to format a citation's  reference  can
       be  output  immediately  after  the  citation, or the references may be accumulated, and the commands output at some later
       point.  If the references are accumulated, then multiple citations of the same reference will produce a  single  formatted
       reference.

       The interpretation of lines between .R1 and .R2 as prepreocessor commands is a feature of GNU refer.  Documents making use
       of this feature can still be processed by AT&T refer just by adding the lines
              .de R1
              .ig R2
              ..
       to  the beginning of the document.  This will cause troff(1) to ignore everything between .R1 and .R2.  The effect of some
       commands can also be achieved by options.  These options are supported mainly for compatibility with AT&T  refer.   It  is
       usually more convenient to use commands.

       refer  generates  .lf requests so that file names and line numbers in messages produced by commands that read refer output
       will be correct; it also interprets lines beginning with .lf so that file names and line numbers in the messages  and  .lf
       lines that it produces will be accurate even if the input has been preprocessed by a command such as soelim(1).

   Bibliographic databases
       The  bibliographic database is a text file consisting of records separated by one or more blank lines.  Within each record
       fields start with a % at the beginning of a line.  Each field has a one character name that immediately follows the %.  It
       is best to use only upper and lower case letters for the names of fields.  The name of the field should be followed by ex‐
       actly one space, and then by the contents of the field.  Empty fields are ignored.  The conventional meaning of each field
       is as follows:

       %A     The name of an author.  If the name contains a suffix such as “Jr.”, it should be separated from the last name by a
              comma.  There can be multiple occurrences of the %A field.  The order is significant.  It is a good idea always  to
              supply an %A field or a %Q field.

       %B     For an article that is part of a book, the title of the book.

       %C     The place (city) of publication.

       %D     The  date  of  publication.  The year should be specified in full.  If the month is specified, the name rather than
              the number of the month should be used, but only the first three letters are required.  It is a good idea always to
              supply a %D field; if the date is unknown, a value such as in press or unknown can be used.

       %E     For an article that is part of a book, the name of an editor of the book.  Where the work has editors  and  no  au‐
              thors,  the names of the editors should be given as %A fields and “, (ed.)” or “, (eds.)” should be appended to the
              last author.

       %G     U.S. government ordering number.

       %I     The publisher (issuer).

       %J     For an article in a journal, the name of the journal.

       %K     Keywords to be used for searching.

       %L     Label.

       %N     Journal issue number.

       %O     Other information.  This is usually printed at the end of the reference.

       %P     Page number.  A range of pages can be specified as m-n.

       %Q     The name of the author, if the author is not a person.  This will only be used if there are no  %A  fields.   There
              can only be one %Q field.

       %R     Technical report number.

       %S     Series name.

       %T     Title.  For an article in a book or journal, this should be the title of the article.

       %V     Volume number of the journal or book.

       %X     Annotation.

       For  all  fields  except  %A and %E, if there is more than one occurrence of a particular field in a record, only the last
       such field will be used.

       If accent strings are used, they should follow the character to be accented.  This means that an ms document must call the
       .AM macro when it initializes.  Accent strings should not be quoted: use one \ rather than two.  Accent strings are an ob‐
       solescent feature of the me and ms macro packages; modern documents should use groff special  character  escape  sequences
       instead; see groff_char(7).

   Citations
       Citations have a characteristic format.
              .[opening-text
              flags keywords
              fields
              .]closing-text

       The opening-text, closing-text, and flags components are optional.  Only one of the keywords and fields components need be
       specified.

       The keywords component says to search the bibliographic databases for a reference that contains all the words in keywords.
       It is an error if more than one reference is found.

       The  fields components specifies additional fields to replace or supplement those specified in the reference.  When refer‐
       ences are being accumulated and the keywords component is non-empty, then additional fields should be  specified  only  on
       the first occasion that a particular reference is cited, and will apply to all citations of that reference.

       The  opening-text  and  closing-text  components  specify  strings to be used to bracket the label instead of those in the
       bracket-label command.  If either of these components is non-empty, the strings specified  in  the  bracket-label  command
       will  not  be used; this behavior can be altered using the [ and ] flags.  Leading and trailing spaces are significant for
       these components.

       The flags component is a list of non-alphanumeric characters each of which modifies the treatment of this particular cita‐
       tion.  AT&T refer will treat these flags as part of the keywords and so will ignore them since they are  non-alphanumeric.
       The following flags are currently recognized.

       #      Use  the  label  specified by the short-label command, instead of that specified by the label command.  If no short
              label has been specified, the normal label will be used.  Typically the short label is used with author-date labels
              and consists of only the date and possibly a disambiguating letter; the “#” is supposed to be suggestive of  a  nu‐
              meric type of label.

       [      Precede opening-text with the first string specified in the bracket-label command.

       ]      Follow closing-text with the second string specified in the bracket-label command.

       An  advantage  of  using the [ and ] flags rather than including the brackets in opening-text and closing-text is that you
       can change the style of bracket used in the document just by changing the bracket-label command.  Another is that  sorting
       and merging of citations will not necessarily be inhibited if the flags are used.

       If  a  label  is to be inserted into the text, it will be attached to the line preceding the .[ line.  If there is no such
       line, then an extra line will be inserted before the .[ line and a warning will be given.

       There is no special notation for making a citation to multiple references.  Just use a sequence of citations, one for each
       reference.  Don't put anything between the citations.  The labels for all the citations will be attached to the line  pre‐
       ceding  the first citation.  The labels may also be sorted or merged.  See the description of the <> label expression, and
       of the sort-adjacent-labels and abbreviate-label-ranges commands.  A label will not be merged if its citation has  a  non-
       empty  opening-text or closing-text.  However, the labels for a citation using the ] flag and without any closing-text im‐
       mediately followed by a citation using the [ flag and without any opening-text may be sorted and merged  even  though  the
       first  citation's  opening-text or the second citation's closing-text is non-empty.  (If you wish to prevent this, use the
       dummy character escape sequence \& as the first citation's closing-text.)

   Commands
       Commands are contained between lines starting with .R1 and .R2.  Recognition of these lines can be prevented by the -R op‐
       tion.  When a .R1 line is recognized any accumulated references are flushed out.  Neither .R1 nor .R2 lines, nor  anything
       between them, is output.

       Commands  are  separated by newlines or semicolons.  A number sign (#) introduces a comment that extends to the end of the
       line, but does not conceal the newline.  Each command is broken up into words.  Words are separated by spaces or tabs.   A
       word that begins with a (neutral) double quote (") extends to the next double quote that is not followed by another double
       quote.   If  there is no such double quote, the word extends to the end of the line.  Pairs of double quotes in a word be‐
       ginning with a double quote collapse to one double quote.  Neither a number sign nor a semicolon is recognized inside dou‐
       ble quotes.  A line can be continued by ending it with a backslash “\”; this works everywhere except after a number sign.

       Each command name that is marked with * has an associated negative command no-name that undoes the effect  of  name.   For
       example, the no-sort command specifies that references should not be sorted.  The negative commands take no arguments.

       In  the  following  description each argument must be a single word; field is used for a single upper or lower case letter
       naming a field; fields is used for a sequence of such letters; m and n are used for a non-negative numbers; string is used
       for an arbitrary string; file is used for the name of a file.

       abbreviate* fields string1 string2 string3 string4
              Abbreviate the first names of fields.  An initial letter will be separated from another initial letter by  string1,
              from  the  last name by string2, and from anything else (such as “von” or “de”) by string3.  These default to a pe‐
              riod followed by a space.  In a hyphenated first name, the initial of the first part of the name will be  separated
              from the hyphen by string4; this defaults to a period.  No attempt is made to handle any ambiguities that might re‐
              sult from abbreviation.  Names are abbreviated before sorting and before label construction.

       abbreviate-label-ranges* string
              Three or more adjacent labels that refer to consecutive references will be abbreviated to a label consisting of the
              first label, followed by string, followed by the last label.  This is mainly useful with numeric labels.  If string
              is omitted, it defaults to “-”.

       accumulate*
              Accumulate  references  instead of writing out each reference as it is encountered.  Accumulated references will be
              written out whenever a reference of the form
                     .[
                     $LIST$
                     .]
              is encountered, after all input files have been processed, and whenever a .R1 line is recognized.

       annotate* field string
              field is an annotation; print it at the end of the reference as a paragraph preceded by the line

                     .string

              If string is omitted, it will default to AP; if field is also omitted it will default to X.  Only one field can  be
              an annotation.

       articles string ...
              Each  string  is a definite or indefinite article, and should be ignored at the beginning of T fields when sorting.
              Initially, “a”, “an”, and “the” are recognized as articles.

       bibliography file ...
              Write out all the references contained in each bibliographic database file.  This command should come  last  in  an
              .R1/.R2 block.

       bracket-label string1 string2 string3
              In the text, bracket each label with string1 and string2.  An occurrence of string2 immediately followed by string1
              will be turned into string3.  The default behavior is as follows.
                     bracket-label \*([. \*(.] ", "

       capitalize fields
              Convert fields to caps and small caps.

       compatible*
              Recognize .R1 and .R2 even when followed by a character other than space or newline.

       database file ...
              Search  each  bibliographic database file.  For each file, if an index file.i created by indxbib(1) exists, then it
              will be searched instead; each index can cover multiple databases.

       date-as-label* string
              string is a label expression that specifies a string with which to replace the D field after constructing  the  la‐
              bel.   See  subsection “Label expressions” below for a description of label expressions.  This command is useful if
              you do not want explicit labels in the reference list, but instead want to handle any necessary  disambiguation  by
              qualifying  the date in some way.  The label used in the text would typically be some combination of the author and
              date.  In most cases you should also use the no-label-in-reference command.  For example,
                     date-as-label D.+yD.y%a*D.-y
              would attach a disambiguating letter to the year part of the D field in the reference.

       default-database*
              The default database should be searched.  This is the default behavior, so the negative version of this command  is
              more  useful.  refer determines whether the default database should be searched on the first occasion that it needs
              to do a search.  Thus a no-default-database command must be given before then, in order to be effective.

       discard* fields
              When the reference is read, fields should be discarded; no string definitions for  fields  will  be  output.   Ini‐
              tially, fields are XYZ.

       et-al* string m n
              Control  use of et al. in the evaluation of @ expressions in label expressions.  If the number of authors needed to
              make the author sequence unambiguous is u and the total number of authors is t then the last t-u  authors  will  be
              replaced  by string provided that t-u is not less than m and t is not less than n.  The default behavior is as fol‐
              lows.
                     et-al " et al" 2 3
              Note the absence of a dot from the end of the abbreviation, which is arguably not correct.  (Et al[.]  is short for
              et alli, as etc. is short for et cetera.)

       include file
              Include file and interpret the contents as commands.

       join-authors string1 string2 string3
              Join multiple authors together with strings.  When there are exactly two authors, they will be joined with string1.
              When there are more than two authors, all but the last two will be joined with string2, and the  last  two  authors
              will be joined with string3.  If string3 is omitted, it will default to string1; if string2 is also omitted it will
              also default to string1.  For example,
                     join-authors " and " ", " ", and "
              will restore the default method for joining authors.

       label-in-reference*
              When  outputting the reference, define the string [F to be the reference's label.  This is the default behavior, so
              the negative version of this command is more useful.

       label-in-text*
              For each reference output a label in the text.  The label will be separated from the surrounding text as  described
              in  the  bracket-label command.  This is the default behavior, so the negative version of this command is more use‐
              ful.

       label string
              string is a label expression describing how to label each reference.

       separate-label-second-parts string
              When merging two-part labels, separate the second part of the second label from the first label with  string.   See
              the description of the <> label expression.

       move-punctuation*
              In  the  text, move any punctuation at the end of line past the label.  It is usually a good idea to give this com‐
              mand unless you are using superscripted numbers as labels.

       reverse* string
              Reverse the fields whose names are in string.  Each field name can be followed by a number which says how many such
              fields should be reversed.  If no number is given for a field, all such fields will be reversed.

       search-ignore* fields
              While searching for keys in databases for which no index exists, ignore the contents of fields.  Initially,  fields
              XYZ are ignored.

       search-truncate* n
              Only  require  the  first  n characters of keys to be given.  In effect when searching for a given key words in the
              database are truncated to the maximum of n and the length of the key.  Initially, n is 6.

       short-label* string
              string is a label expression that specifies an alternative (usually shorter) style of label.  This is used when the
              # flag is given in the citation.  When using author-date style labels, the identity of the  author  or  authors  is
              sometimes  clear  from  the  context, and so it may be desirable to omit the author or authors from the label.  The
              short-label command will typically be used to specify a label containing just a date and possibly a  disambiguating
              letter.

       sort* string
              Sort  references  according  to  string.  References will automatically be accumulated.  string should be a list of
              field names, each followed by a number, indicating how many fields with the name should be used for  sorting.   “+”
              can  be used to indicate that all the fields with the name should be used.  Also . can be used to indicate the ref‐
              erences should be sorted using the (tentative) label.  (Subsection “Label expressions” below describes the  concept
              of a tentative label.)

       sort-adjacent-labels*
              Sort  labels  that are adjacent in the text according to their position in the reference list.  This command should
              usually be given if the abbreviate-label-ranges command has been given, or if the label expression  contains  a  <>
              expression.  This will have no effect unless references are being accumulated.

   Label expressions
       Label  expressions  can  be  evaluated both normally and tentatively.  The result of normal evaluation is used for output.
       The result of tentative evaluation, called the tentative label, is used to gather the information that  normal  evaluation
       needs to disambiguate the label.  Label expressions specified by the date-as-label and short-label commands are not evalu‐
       ated tentatively.  Normal and tentative evaluation are the same for all types of expression other than @, *, and % expres‐
       sions.  The description below applies to normal evaluation, except where otherwise specified.

       field
       field n
              The n-th part of field.  If n is omitted, it defaults to 1.

       'string'
              The characters in string literally.

       @      All  the  authors  joined  as specified by the join-authors command.  The whole of each author's name will be used.
              However, if the references are sorted by author (that is, the sort specification starts with “A+”),  then  authors'
              last  names  will be used instead, provided that this does not introduce ambiguity, and also an initial subsequence
              of the authors may be used instead of all the authors, again provided that this does not introduce ambiguity.   The
              use  of  only  the  last  name for the i-th author of some reference is considered to be ambiguous if there is some
              other reference, such that the first i-1 authors of the references are the same, the i-th authors are not the same,
              but the i-th authors last names are the same.  A proper initial subsequence of the sequence  of  authors  for  some
              reference  is considered to be ambiguous if there is a reference with some other sequence of authors which also has
              that subsequence as a proper initial subsequence.  When an initial subsequence of authors is  used,  the  remaining
              authors  are  replaced  by  the string specified by the et-al command; this command may also specify additional re‐
              quirements that must be met before an initial subsequence can be used.  @ tentatively evaluates to a canonical rep‐
              resentation of the authors, such that authors that compare equally for sorting purpose will have the same represen‐
              tation.

       %n
       %a
       %A
       %i
       %I     The serial number of the reference formatted according to the character following the %.  The serial  number  of  a
              reference  is  1  plus the number of earlier references with same tentative label as this reference.  These expres‐
              sions tentatively evaluate to an empty string.

       expr*  If there is another reference with the same tentative label as  this  reference,  then  expr,  otherwise  an  empty
              string.  It tentatively evaluates to an empty string.

       expr+n
       expr-n The  first (+) or last (-) n upper or lower case letters or digits of expr.  roff special characters (such as \('a)
              count as a single letter.  Accent strings are retained but do not count towards the total.

       expr.l expr converted to lowercase.

       expr.u expr converted to uppercase.

       expr.c expr converted to caps and small caps.

       expr.r expr reversed so that the last name is first.

       expr.a expr with first names abbreviated.  Fields specified in the abbreviate command are abbreviated  before  any  labels
              are evaluated.  Thus .a is useful only when you want a field to be abbreviated in a label but not in a reference.

       expr.y The year part of expr.

       expr.+y
              The part of expr before the year, or the whole of expr if it does not contain a year.

       expr.-y
              The part of expr after the year, or an empty string if expr does not contain a year.

       expr.n The last name part of expr.

       expr1~expr2
              expr1 except that if the last character of expr1 is - then it will be replaced by expr2.

       expr1 expr2
              The concatenation of expr1 and expr2.

       expr1|expr2
              If expr1 is non-empty then expr1 otherwise expr2.

       expr1&expr2
              If expr1 is non-empty then expr2 otherwise an empty string.

       expr1?expr2:expr3
              If expr1 is non-empty then expr2 otherwise expr3.

       <expr> The  label  is  in  two parts, which are separated by expr.  Two adjacent two-part labels which have the same first
              part will be merged by appending the second part of the second label onto the first label separated by  the  string
              specified  in the separate-label-second-parts command (initially, a comma followed by a space); the resulting label
              will also be a two-part label with the same first part as before merging, and so additional labels  can  be  merged
              into  it.   It  is  permissible  for  the first part to be empty; this may be desirable for expressions used in the
              short-label command.

       (expr) The same as expr.  Used for grouping.

       The above expressions are listed in order of precedence (highest first); & and | have the same precedence.

   Macro interface
       Each reference starts with a call to the macro ]-.  The string [F will be defined to be the label for this reference,  un‐
       less  the  no-label-in-reference  command has been given.  There then follows a series of string definitions, one for each
       field: string [X corresponds to field X.  The register [P is set to 1 if the P field contains a range of pages.   The  [T,
       [A  and  [O  registers are set to 1 according as the T, A and O fields end with any of .?! (an end-of-sentence character).
       The [E register will be set to 1 if the [E string contains more than one name.  The reference is followed by a call to the
       ][ macro.  The first argument to this macro gives a number representing the type of the reference.  If  a  reference  con‐
       tains  a  J field, it will be classified as type 1, otherwise if it contains a B field, it will be type 3, otherwise if it
       contains a G or R field it will be type 4, otherwise if it contains an I field it will be type 2,  otherwise  it  will  be
       type  0.   The  second  argument  is  a  symbolic  name  for  the  type: other, journal-article, book, article-in-book, or
       tech-report.  Groups of references that have been accumulated or are produced by the bibliography command are preceded  by
       a call to the ]< macro and followed by a call to the ]> macro.

Options
       --help displays a usage message, while -v and --version show version information; all exit afterward.

       -R     Don't recognize lines beginning with .R1/.R2.

       Other options are equivalent to refer commands.

       -a n            reverse An

       -b              no-label-in-text; no-label-in-reference

       -B              See below.

       -c fields       capitalize fields

       -C              compatible

       -e              accumulate

       -f n            label %n

       -i fields       search-ignore fields

       -k              label L~%a

       -k field        label field~%a

       -l              label A.nD.y%a

       -l m            label A.n+mD.y%a

       -l ,n           label A.nD.y-n%a

       -l m,n          label A.n+mD.y-n%a

       -n              no-default-database

       -p db-file      database db-file

       -P              move-punctuation

       -s spec         sort spec

       -S              label "(A.n|Q) ', ' (D.y|D)"; bracket-label " (" ) "; "

       -t n            search-truncate n

       The  B option has command equivalents with the addition that the file names specified on the command line are processed as
       if they were arguments to the bibliography command instead of in the normal way.

       -B              annotate X AP; no-label-in-reference

       -B field.macro  annotate field macro; no-label-in-reference

Environment
       REFER  If set, overrides the default database.

Files
       /usr/dict/papers/Ind
              Default database.

       file.i Index files.

       /usr/share/groff/1.23.0/tmac/refer.tmac
              defines macros and strings facilitating integration with macro packages that wish to support refer.

       refer uses temporary files.  See the groff(1) man page for details of where such files are created.

Bugs
       In label expressions, <> expressions are ignored inside .char expressions.

Examples
       We can illustrate the operation of refer with a sample bibliographic database containing one entry and a simple roff docu‐
       ment to cite that entry.

              $ cat > my-db-file
              %A Daniel P.\& Friedman
              %A Matthias Felleisen
              %C Cambridge, Massachusetts
              %D 1996
              %I The MIT Press
              %T The Little Schemer, Fourth Edition
              $ refer -p my-db-file
              Read the book
              .[
              friedman
              .]
              on your summer vacation.
              <Control+D>
              .lf 1 -
              Read the book\*([.1\*(.]
              .ds [F 1
              .]-
              .ds [A Daniel P. Friedman and Matthias Felleisen
              .ds [C Cambridge, Massachusetts
              .ds [D 1996
              .ds [I The MIT Press
              .ds [T The Little Schemer, Fourth Edition
              .nr [T 0
              .nr [A 0
              .][ 2 book
              .lf 5 -
              on your summer vacation.

       The foregoing shows us that refer (a) produces a label “1”; (b) brackets that label with interpolations of the  “[.”   and
       “.]”   strings;  (c) calls a macro “]-”; (d) defines strings and registers containing the label and bibliographic data for
       the reference; (e) calls a macro “][”; and (f) uses the lf request to restore the line numbers of the original input.   As
       discussed in subsection “Macro interface” above, it is up to the document or a macro package to employ and format this in‐
       formation usefully.  Let us see how we might turn groff_ms(7) to this task.

              $ REFER=my-db-file groff -R -ms
              .LP
              Read the book
              .[
              friedman
              .]
              on your summer vacation.
              Commentary is available.\*{*\*}
              .FS \*{*\*}
              Space reserved for penetrating insight.
              .FE

       ms's  automatic footnote numbering mechanism is not aware of refer's label numbering, so we have manually specified a (su‐
       perscripted) symbolic footnote for our non-bibliographic aside.

See also
       “Some Applications of Inverted Indexes on the Unix System”, by M. E. Lesk, 1978, AT&T Bell Laboratories Computing  Science
       Technical Report No. 69.

       indxbib(1), lookbib(1), lkbib(1)

groff 1.23.0                                              31 March 2024                                                  refer(1)