uusegversion

Unicode text segmentation for OCaml

Uuseg is an OCaml library for segmenting Unicode text. It implements the locale independent Unicode text segmentation algorithms to detect grapheme cluster, word and sentence boundaries and the Unicode line breaking algorithm to detect line break opportunities.

The library is independent from any IO mechanism or Unicode text data structure and it can process text without a complete in-memory representation.

Uuseg depends on Uucp and optionally on Uutf for support on OCaml UTF-X encoded strings. It is distributed under the ISC license.

Homepage: http://erratique.ch/software/uuseg

Tags unicode text segmentation org:erratique
AuthorThe uuseg programmers
LicenseISC
Published
Homepagehttps://erratique.ch/software/uuseg
Issue Trackerhttps://github.com/dbuenzli/uuseg/issues
MaintainerDaniel B├╝nzli <daniel.buenzl i@erratique.ch>
Dependencies
Optional dependencies
Conflicts
Source [http] https://erratique.ch/software/uuseg/releases/uuseg-14.0.0.tbz
sha512=3f089baf95f010663a0c2f060b2911395d9b396f478efb10fd979815f527c9e61e0a70b3192f2e921f59287bfde0da6e25109d4a1825554e2e4a50c0535e97aa
Edithttps://github.com/ocaml/opam-repository/tree/master/packages/uuseg/uuseg.14.0.0/opam