A second major difference is the kind of information used by TBL. TBL first tags data using the most frequent category of the elements. The system can then use the category of adjacent elements (left and right), information very useful that ALLiS does not use. ALLiS could use the category of the preceding elements, since this information will be available during parsing, but we do not incorporate this information into the features3. Both systems do not use the same information, but they provide comparable results for this task (NP extraction) (Table 5). Besides, ALLiS only generates 1369 rules whereas TBL stops after 2000 rules. The quality of the rules seems then to be better. Table 5 also shows the improvement due to the use of the lexicalized information (W feature).
Another interesting point in TBL is the absence of threshold, since, at each iteration, the best rule is chosen.
No threshold equivalent to our
is needed. The stopping criterion is determined by the maximum number of rules the system learns (after n rules, it stops).
Increasing the number of rules generally improves results.
After this first experiment using no prior knowledge, we now present the linguistic knowledge we will used in order to improve ALLiS.