Added tabgen + manual
This commit is contained in:
116
util/cmisc/tabgen.1
Normal file
116
util/cmisc/tabgen.1
Normal file
@@ -0,0 +1,116 @@
|
||||
.TH TABGEN 1ACK
|
||||
.ad
|
||||
.SH NAME
|
||||
tabgen \- table generator for C-programs
|
||||
.SH SYNOPSYS
|
||||
.B tabgen \fIarguments\fP
|
||||
.SH DESCRIPTION
|
||||
.I Tabgen
|
||||
is a handy tool for generating tables for C-programs from a compact
|
||||
description. The current version is only suitable for generating character
|
||||
tables. The output is produced on standard output.
|
||||
It works by maintaining an internal table of values, printing this table
|
||||
when this is requested.
|
||||
.PP
|
||||
Each argument given to
|
||||
.I tabgen
|
||||
is either a command or a description. Descriptions are discussed first.
|
||||
.PP
|
||||
A description consists of a value (a string), directly followed by a semicolon,
|
||||
directly followed by a list of indices for which the table to be generated
|
||||
has this value. This list of indices must be in a certain \fBinputformat\fP,
|
||||
characterized by a charactet.
|
||||
Currently, there is only one inputformat, "c". In this format, the indices
|
||||
are characters. There are two special characters: '\e' and '-'. The
|
||||
'\e' behaves like in a C-string, and the '-' describes a range, unless
|
||||
it starts the list of indices.
|
||||
.PP
|
||||
Some examples of descriptions:
|
||||
.nf
|
||||
STIDF:a-zA-Z_
|
||||
STSKIP:\er \et\e013\ef
|
||||
.fi
|
||||
.PP
|
||||
These descriptions have the effect that the internal table values for
|
||||
'a' through 'z', 'A' through 'Z', and '_' are set to STIDF, and that the
|
||||
internal table values for carriage-return, space, tab, vertical-tab, and
|
||||
form-feed are set to STSKIP.
|
||||
.PP
|
||||
A command is introduced by a special character. On the command line,
|
||||
a command is introduced by a '-'. The following commands are
|
||||
recognized:
|
||||
.IP I\fIchar\fP
|
||||
switch to a different input format. This command is only there for future
|
||||
extensions.
|
||||
.IP f\fIfile\fP
|
||||
read input from the file \fIfile\fP. In a file, each line is an argument
|
||||
to \fItabgen\fP. Each line is either a command or a description. In a file,
|
||||
commands are introduced by a '%'.
|
||||
.IP F\fIformat\fP
|
||||
Values are printed in a printf format. The default value for this format
|
||||
is \fB"%s,\en"\fP. This can be changed with this command.
|
||||
.IP T\fItext\fP
|
||||
Print \fItext\fP literally at this point.
|
||||
.IP p
|
||||
Print the table as it is built at this point.
|
||||
.IP C
|
||||
Clear the table. This sets all internal table values to 0.
|
||||
.IP i\fIstr\fP
|
||||
Initialize all internal table values to \fIstr\fP. if \fIstr\fP is not
|
||||
given, this command is equivalent to the C command.
|
||||
.IP S\fInum\fP
|
||||
Set the table size to \fInum\fP entries. The default size is 128.
|
||||
.IP H\fIfilename\fP
|
||||
Create tables which can be indexed by the full range of characters,
|
||||
rather than 0..127. As this depends on the implementation of 'char'
|
||||
in C (signed or unsigned), this also generates a file \fIfilename\fP,
|
||||
with a #define for the constant "CharOffset". The generated tables can
|
||||
be indexed by first adding "CharOffset" to the base of the table.
|
||||
If \fIfilename\fP is not given, "charoffset.h" is used.
|
||||
.SH "AN EXAMPLE"
|
||||
.PP
|
||||
The next example is a part of the \fItabgen\fP description of the
|
||||
character tables used by the lexical analyser of the ACK Modula-2 compiler.
|
||||
This description resides in a file called char.tab.
|
||||
.I
|
||||
Tabgen
|
||||
is called as follows:
|
||||
.nf
|
||||
tabgen -fchar.tab > char.c
|
||||
.fi
|
||||
.PP
|
||||
The description as given here generates 2 tables: one indicating a class
|
||||
according to which token a character can be a start of, and one indicating
|
||||
whether a character may occur in an identifier.
|
||||
.nf
|
||||
|
||||
% Comments are introduced with space or tab after the %
|
||||
%S129
|
||||
%F %s,
|
||||
% CHARACTER CLASSES
|
||||
%iSTGARB
|
||||
STSKIP: \et\e013\e014\e015
|
||||
STNL:\e012
|
||||
STSIMP:-#&()*+,/;=[]^{|}~
|
||||
STCOMP:.:<>
|
||||
STIDF:a-zA-Z
|
||||
STSTR:"'
|
||||
STNUM:0-9
|
||||
STEOI:\e200
|
||||
%T#include "class.h"
|
||||
% class.h contains #defines for STSKIP, STNL, ...
|
||||
%Tchar tkclass[] = {
|
||||
%p
|
||||
%T};
|
||||
% INIDF
|
||||
%C
|
||||
1:a-zA-Z0-9
|
||||
%Tchar inidf[] = {
|
||||
%F %s,
|
||||
%p
|
||||
%T};
|
||||
.fi
|
||||
.SH BUGS
|
||||
.PP
|
||||
.I Tabgen
|
||||
assumes that characters are 8 bits wide.
|
||||
Reference in New Issue
Block a user