Locations¶
Locations
The location is a concept often used in Camlp5, bound to where errors
occur in the source. The basic type is “Ploc.t
” which is an
abstract type.
Definitions
Internally a location is a pair of source positions: the beginning and the end of an element in the source (file or interactive). A located element can be a character (the end is just the beginning plus one), a token, or a longer sequence generally corresponding to a grammar rule.
A position is a count of characters since the beginning of the file, starting at zero. When a couple of positions define a location, the first position is the position of the first character of the element, and the last position is the first character not part of the element. The location length is the difference between those two numbers. Notice that the position corresponds exactly to the character count in the streams of characters.
In the extensible grammars, a variable with the
specific name “loc
” is predefined in all semantic actions: it is
the location of the associated rule. Since the syntax tree
quotations generate nodes with “loc
” as location
part, this allow to generate grammars without having to consider
source locations.
It is possible to change the name “loc
” to another name, through
the parameter “-loc
” of the Camlp5 commands.
Remark: the reason why the type “location
” is abstract is that in
future versions, it may contain other informations, such as the
associated comments, the type (for expressions nodes), things like
that, without having to change the already written programs.
Building locations
Tools are provided in the module “Ploc
” to manage locations.
First, “Ploc.dummy
” is a dummy location used when the element
does not correspond to any source, or if the programmer does not want
to worry about locations.
The function “Ploc.make
” builds a location from three parameters:
- the line number, starting at 1
- the position of the first column of the line
- a couple of positions of the location: the first one belonging to the given line, the second one being able to belong to another line, further.
If the line number is not known, it is possible to use the function
“Ploc.make_unlined
” taking only the couple of positions of the
location. In this case, error messages may indicate the first line
and a big count of characters from this line (actually from the
beginning of the file). With a good text editor, it is possible, to
find the good location, anyway.
If the location is built with “Ploc.make_unlined
”, and if your
program displays a source location itself, it is possible to use the
function “Ploc.from_file
” which takes the file name and the
location as parameters and return, by reading that file, the line
number, and the character positions of the location.
Raising with a location
The function “Ploc.raise
” allows one to raise an exception
together with a location. All exceptions raised in the extensible
grammars use “Ploc.raise
”. The raised
exception is “Ploc.Exc
” with two parameters: the location and the
exception itself.
Notice that “Ploc.raise
” just reraises the exception if it is
already the exception “Ploc.Exc
”, ignoring then the new given
location.
A paradigm to print exceptions possibly enclosed by “Ploc.Exc
” is
to write the “try..with
” statement like this:
try ... with exn ->
let exn =
match exn with
[ Ploc.Exc loc exn -> do { ... print the location ...; exn }
| _ -> exn ]
in
match exn with
...print the exception which is *not* located...
Other functions
Some other functions are provided:
Ploc.first_pos
- returns the first position (an integer) of the location.
Ploc.last_pos
- returns the last position (an integer) of the location (position of the first character not belonging to the element.
Ploc.line_nb
- returns the line number of the location or
-1
if the location does not contain a line number (i.e. built by “Ploc.make_unlined
”). Ploc.bol_pos
- returns the position of the beginning of the line of the location.
It is zero if the location does not contain a line number (i.e.
built by “
Ploc.make_unlined
”).
And still other ones used in Camlp5 sources:
Ploc.encl
- “
Ploc.encl loc1 loc2
” returns the location starting at the smallest begin of “loc1
” and “loc2
” and ending at their greatest end.. In simple words, it is the location enclosing “loc1
” and “loc2
” and all what is between them. Ploc.shift
- “
Ploc.shift sh loc
” returns the location “loc
” shifted with “sh
” characters. The line number is not recomputed. Ploc.sub
- “
Ploc.sub loc sh len
” is the location “loc
” shifted with “sh
” characters and with length “len
”. The previous ending position of the location is lost. - “
Ploc.after
” - “
Ploc.after loc sh len
” is the location just after “loc
” (i.e. starting at the end position of “loc
”), shifted with “sh
” characters, and of length “len
”.