Ruby escape - HTML/URI/shell escaping utilities
Features:
- several escaping/composing functions
* HTML text
* HTML attribute value
* URI path
* shell command line
- dedicated classes for escaped strings
- escape and compose strongly related strings at once
WWW: http://www.a-k-r.org/escape/
Ruby objects with linguistic methods.
It includes an English-language module with inflection,
pluralisation, conjunctions, indefinite articles, present
participles, ordinal numbers, numbers to words, general
quantification, integration with WordNet and CMU's LinkGrammar,
as well as a framework for providing modules for other languages.
WWW: http://deveiate.org/projects/Linguistics/
PR: ports/155701
Submitted by: Eric Freeman <freebsdports at chillibear.com>
This package provides a translation string class, a translation string
factory class, translation and pluralization primitives, and a utility
that helps Chameleon templates use translation facilities of this package.
It does not depend on Babel, but its translation and pluralization
services are meant to work best when provided with an instance of
the babel.support.Translations class.
WWW: http://pypi.python.org/pypi/translationstring
for accents etc. into their Unicode equivalents. It translates
commands for special characters or accents into their Unicode
equivalents and removes formatting commands.
WWW: http://search.cpan.org/dist/LaTeX-ToUnicode/
create or transform office documents, or to extract data from them. It can
handle documents which comply with the Open Document Format international
standard (ODF). It may handle text documents (ODT), spreadsheet documents(ODS),
as well as presentation(ODP) or drawing documents (ODG).
WWW: http://search.cpan.org/dist/ODF-lpOD/
XML::LibXML::SAX::ChunkParser's purpose is to simply keep parsing
possibly incomplete XML fragments, for example, from a socket.
WWW: http://search.cpan.org/dist/XML-LibXML-SAX-ChunkParser/
PR: ports/155340
Submitted by: "ports@c0decafe.net" <ports@c0decafe.net>
Three main modules and some supporting program files are contained. For
more details read following POD documentations:
Text::LineFold - Line Folding for Plain Text
Unicode::GCString - String as Sequence of UAX #29 Grapheme Clusters
Unicode::LineBreak - UAX #14 Unicode Line Breaking Algorithm
WWW: http://search.cpan.org/dist/Unicode-LineBreak/
PR: ports/154845
Submitted by: Olivier Girard <Olivier.Girard at univ-angers.fr>
A generic swappable back-end for XML parsing.
Lots of Ruby libraries utilize XML parsing in some form, and
everyone has their favorite XML library. In order to best
support multiple XML parsers and libraries, multi_xml is a
general-purpose swappable XML backend library.
WWW: http://rubygems.org/gems/multi_xml
- Clean up bsd.python.mk (remove PYWSGIREF, PYHASHLIB, PYCTYPES and PYEXPAT) all these is now part of python
since python25
Remove:
textproc/py-expat
devel/py-ctypes
security/py-hashlib
www/py-wsgiref
XML/Ada is a set of modules that provide a simple manipulation of XML
streams. It supports the whole XML 1.1 specification and can parse any file
that follows this standard, including the contents of the DTD although no
validation of the documents is performed based on those.
It provides support for a number of standards associated with XML such as
SAX, DOM, and XML schemas. Additionally, it includes a module to manipulate
unicode streams since this is required by the XML standard.
WWW: http://libre.adacore.com/libre/tools/xmlada/
PR: ports/153828
Submitted by: John Marino <freebsdml@marino.st>
This is a port of the 3.x series of the Apache Xerces-C++ library.
Xerces-C++ is a validating XML parser written in a portable subset of C++.
Xerces-C++ makes it easy to give your application the ability to read and
write XML data. A shared library is provided for parsing, generating,
manipulating, and validating XML documents using the DOM, SAX, and SAX2 APIs.
http://loc.gov/marc/. This package enables you to read existing MARC records
from a file, string, or (using the YAZ extension), from a Z39.50 source. You
can also use this package to create new MARC records.
WWW: http://pear.php.net/package/File_MARC
to be a parser or query verifier. It just creates sane tokens from a
valid SQL query.
WWW: http://search.cpan.org/dist/SQL-Tokenizer
PR: ports/154706
Submitted by: milki <milki AT rescomp.berkeley.edu>
You can use it to produce PDF or PS files from a HTML-document.
Currently many but not all parameters of HTMLDoc are supported.
WWW: http://search.cpan.org/dist/HTML-HTMLDoc/
PR: ports/153955
Submitted by: Menshikov Konstantin <kostjnspb@yandex.ru>
Feature safe: yes
- Update textproc/libwpd to 0.9.0 release
- Add graphics/libwpg01 and textproc/libwpd08 as support libraries only
- Take maintainership of textproc/libwpd[08] [1]
- Chase dependencies in related ports
- Describe updating procedure in UPDATING
Approved by: Mikhail Teterin (maintainer) via mail [1]
Features:
* search for regular expressions.
* support for some important grep options, including:
o filename output.
o page number output.
o optional case insensitivity.
o count occurrences.
* and the most important feature: color output!
WWW: http://pdfgrep.sourceforge.net/
PR: ports/152648
Submitted by: Michael Ranner <michael@ranner.eu>
Internationalized Domain Names (IDNs).
IDNs use characters drawn from a large repertoire (Unicode), but IDNA
allows the non-ASCII characters to be represented using only the ASCII
characters already allowed in so-called host names today (letter-digit-
hypen, "/[A-Z0-9-]/i").
tools may provide very nice user interfaces, but tend to produce
horrible XML. If you have to read these ugly products, you are in for
pain. The purpose of this module (and the script "xmlrewrite" which is
part of this distribution) is to be able to rewrite XML messages and
Schema's into something maintainable.
The main difference between this module and other beautifiers is that
the clean-up is based on schema rules. For instance, it is permitted
to remove blanks around and inside integers, but not in strings.
Beautifiers which do not look into the schema have only limited
possibilities for cleanup, or may accidentally change the message
content.
WWW: http://search.cpan.org/dist/XML-Rewrite/
PR: ports/152658
Submitted by: Patrick Powell <papowell@astart.com>
editing a file in-place. Inspired by variations of Perl's -i option,
this module is intended for somewhat more structured and reusable
editing than command line Perl typically allows.
File::Inplace endeavors to guarantee file integrity; that is, either
all of the changes made will be saved to the file, or none will.
It also offers functionality such as backup creation, automatic
field splitting per-line, automatic chomping/unchomping, and aborting
edits partially through without affecting the original file.
WWW: http://search.cpan.org/dist/File-Inplace/
- Add textproc/uim-kde4, KDE4 panel applet of uim input method. It is a
slave port of textproc/uim.
- Remove japanese/uim-canna and japanese/uim-prime. Now the canna and prime
modules are implemented as scheme scripts and they can be installed from
textproc/uim.
- Enable helper tools installation of textproc/uim-qt4.
- Enable sj3 support of textproc/uim.
full-featured text search engine written in Java.
CLucene is faster than lucene as it is written in C++.
WWW: http://sourceforge.net/projects/clucene/
PR: ports/151951
Submitted by: Tom Judge <tom at tomjudge.com>
to their Unicode characters. A function for decoding is provided.
The mapping can be generated from a DTD file with entity definitions.
WWW: http://search.cpan.org/dist/XML-Entities/
CSS::Simple is an interface through which to read/write/manipulate CSS
files while respecting the cascade order.
WWW: http://search.cpan.org/dist/CSS-Simple
PR: ports/151798
Submitted by: Vivek Khera <vivek@khera.org>
Approved by: pgollucci (mentor, implicit)
RT is a simple and human-readable table format.
RTtool is a converter from RT into various formats.
RT can be incorporated into RD.
At this time, RTtool can convert RT into HTML and plain text.
To convert into plain text, you need w3m.
WWW: http://www.rubyist.net/~rubikitch/computer/rttool/index.en.html
It inherits Text::CSV and is aware of input/output encodings.
WWW: http://search.cpan.org/dist/Text-CSV-Encoded/
PR: ports/150710
Submitted by: Sebastien Santoro <dereckson@gmail.com>
in ePub format.
* Free and open source software under GPLv3
* Multi-platform: runs on Windows, FreeBSD, Linux and Mac
* Full Unicode support: everything you see in Sigil is in UTF-16
* Full EPUB spec support
* WYSIWYG editing
* Multiple Views: Book View, Code View and Split View
* Metadata editor with full support for all possible metadata entries
(more than 200) with full descriptions for each
* Table Of Contents editor
* Multi-level TOC support
* Book View fully supports the display of any XHTML document possible
under the OPS spec
* SVG support
* Basic XPGT support
* Advanced automatic conversion of all imported documents to Unicode
* Currently imports TXT, HTML and EPUB files; more will be added with time
* Embedded HTML Tidy; all imported documents are thoroughly cleaned;
changing views cleans the document so no matter how much you screw
up your code, it will fix it (usually)
* An actually usable user interface
WWW: http://code.google.com/p/sigil/
PR: ports/150348
Submitted by: Jonathan Chen <jonc@chen.org.nz>
all macro processors. It is still maintained and ported to a large number
of systems.
WWW: http://www.ml1.org.uk
PR: ports/150234
Submitted by: Bob Eager <rde at ml1.org.uk>