Lingua::IdSplitter - split identifiers into words
version 0.03
use Lingua::IdSplitter;
my $splitter = Lingua::IdSplitter->new;
$splitter->split($identifier);
This module implements an algorithm to identify and split multi-word identifier in their individual words. For example, "UserFind" in "user" and "find", or "timesort" in "time" and "sort".
For more details on the algorithm check the following article (also available here).
Create a new splitter object. A list of specific dictionaries is optional,
check the bin/id-splitter command for an example on how to use more
dictionaries.
Perform a soft split of the identifier, ie split words without using explicit markers (eg, the underscore character, or CamelCase notation).
Perform a hard split of the identifier, ie split words using explicit markers (eg, the underscore character, or CamelCase notation).
Perform a split applying first a hard split, and the applying a soft split to the resulting set of the first split.
Show the computed ranked (including scores) for a split.
Nuno Carvalho smash@cpan.org
This software is copyright (c) 2014-2015 by Project Natura natura@natura.di.uminho.pt.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.