6. Conclusions
In this study, six claim syntax peculiarities that increase the dif- ficulty of parsing are highlighted. They are (1) claim template, (2) post attribute past participle, (3) parenthetical sentence, (4) complex noun phrase as sentence, (5) recursion, and, (6) coordination. These peculiarities cause long claims. Especially, the last four peculiarities lead to Long Distance Dependencies. A new two-level parser is proposed for patent claim parsing. It is designed to improve the adaptability of a conventional parser, e.g. Stanford parser. The conventional parser (in the first level) is evoked by a higher-level (in the second level) parser, which can handle the peculiarities of claim syntax. With respect to peculiarity (1), a trimming process is adopted to filter non-informative content. With respect to peculiarity (2), a POS correction process is adopted to change past form into past participle. With respect to last four peculiarities, a new dependency syntax called Independent Claim Segment Dependency Syntax (ICSDS) is proposed. To guarantee the efficiency of the proposed parser, a segmentation strategy is adopted. The segmentation and consequent assembly is executed by the ICSDSbased parser. The conventional parser is only evoked when processing each claim segment. Theoretically, the distance (in terms of segments) between two segments is much smaller than the distance (in terms of words) between two words within these two segments, respectively. Thus, the distance of the dependency becomes shorter and is easier to be captured.