uusegversion

Unicode text segmentation for OCaml

Uuseg is an OCaml library for segmenting Unicode text. It implements the locale independent Unicode text segmentation algorithms to detect grapheme cluster, word and sentence boundaries and the Unicode line breaking algorithm to detect line break opportunities.

The library is independent from any IO mechanism or Unicode text data structure and it can process text without a complete in-memory representation.

Uuseg depends on Uucp and optionally on Uutf for support on OCaml UTF-X encoded strings. It is distributed under the ISC license.

Tags segmentation text unicode org:erratique
AuthorDaniel Bünzli <daniel.buenzl i@erratique.ch>
LicenseISC
Published
Homepagehttp://erratique.ch/software/uuseg
Issue Trackerhttps://github.com/dbuenzli/uuseg/issues
MaintainerDaniel Bünzli <daniel.buenzl i@erratique.ch>
Dependencies
Optional dependencies
Conflicts
Source [http] http://erratique.ch/software/uuseg/releases/uuseg-1.0.1.tbz
sha256=3b4ba84e70b972e013b1a0265523bce9432883f9ab7ab82a74b24e6fb70c8bef
md5=bd7b27ebe493d5bcec08c23395b5ae7d
Edithttps://github.com/ocaml/opam-repository/tree/master/packages/uuseg/uuseg.1.0.1/opam
Required by