Commit graph

1506 commits

Author SHA1 Message Date
Dmitry Marakasov
fc08b00089 This is YALI, its a LOLCODE interpreter, written in perl.
WWW: http://1.618034.com/yali-yet-another-lolcode-interpreter/index.php

- While here, keep textproc/Makefile sorted

PR:		141173
Submitted by:	Tom Sparks <tsparks@appliedops.net>
2010-01-14 14:58:03 +00:00
Gabor Pali
5947f4845e The texmathml Haskell library provides functions to convert LaTeX math
formulas to presentation MathML. It supports basic LaTeX and AMS
extensions, but not macros.

WWW: http://hackage.haskell.org/package/texmath

PR:		ports/142580
Submitted by:	Jacula Modyun <jacula(at)gmail.com>
2010-01-12 22:22:11 +00:00
Gabor Pali
bf227d9075 Interfacing with RSS (v 0.9x, 2.x, 1.0) + Atom feeds in Haskell.
WWW: http://hackage.haskell.org/package/feed

PR:		ports/142568
Submitted by:	Jacula Modyun <jacula(at)gmail.com>
2010-01-12 22:17:55 +00:00
Gabor Pali
710cd8e50b A simple XML library for Haskell.
WWW: http://hackage.haskell.org/package/xml

PR:		ports/142567
Submitted by:	Jacula Modyun <jacula(at)gmail.com>
2010-01-12 22:16:04 +00:00
Gabor Pali
f879f12e25 Basic implementation of the standard diff algorithm in Haskell.
WWW: http://hackage.haskell.org/package/Diff

PR:		ports/142564
Submitted by:	Jacula Modyun <jacula(at)gmail.com>
2010-01-12 22:11:25 +00:00
Philip M. Gollucci
f8a3defe5d glpi-plugins-DataInjection is a glpi plugin.
It allows data import into GLPI using CSV files.

It allows to create models of injection for a future re-use.
It's been created in order to :
	* Import datas coming from others asset management softwares
	* Inject electronic delivery forms
Datas to be imported using the plugains are :
	* Inventory datas (except softwares and licenses)
	* Management datas (contract, contact, supplier)
	* Configuration datas (user, group, entity)

PR:		ports/140177
Submitted by:	ddurieux <d.durieux at siprossii.com>
2010-01-09 23:20:47 +00:00
Philip M. Gollucci
5259fee7c3 glpi-plugins-AdditionalReports is a glpi plugin.
It enables additional reports.

Main features :

	* It also plugin allow you to add new reports in a simply way
	  (one PHP script for the report and one for the translation).
	* It handle the right for each new report
	* It provides some new reports (as sample)

PR:		ports/140176
Submitted by:	ddurieux <d.durieux at siprossii.com>
2010-01-09 23:20:23 +00:00
Martin Wilke
3b7cfc1c84 Quick CSS parser
WWW: http://search.cpan.org/dist/CSS-Croco

PR:		ports/142050
Submitted by:	andrey at kostenko.name
2010-01-06 22:02:33 +00:00
Gabor Pali
dd2f74778b TagSoup is a Haskell library for extracting information out of
unstructured HTML code, sometimes known as tag-soup. The HTML does not
have to be well formed, or render properly within any particular
framework.  This library is for situations where the author of the HTML
is not cooperating with the person trying to extract the information, but
is also not trying to hide the information.

WWW: http://www-users.cs.york.ac.uk/~ndm/tagsoup/

PR:		ports/142183
Submitted by:	Jacula Modyun <jacula(at)gmail.com>
2010-01-02 23:18:07 +00:00
Sergey Skvortsov
6b89683496 Add pootle 2.0.0, pootle is a user-friendly web portal for simple
translation process.
2009-12-31 11:53:33 +00:00
Martin Wilke
8d31ff2abd Text::Emoticon::MSN is a text filter that replaces text emoticons
like ":-)", ";-P", etc. to the icons of MSN Messenger, detailed
in http://messenger.msn.com/Resource/Emoticons.aspx

WWW: http://search.cpan.org/dist/Text-Emoticon-MSN/

PR:		ports/142027
Submitted by:	Alex Deiter <Alex.Deiter at Gmail.COM>
2009-12-26 18:00:25 +00:00
Martin Wilke
8dcc339761 Ruby library for rendering safe templates which cannot affect the
security of the server they are rendered on.

WWW: http://www.liquidmarkup.org/

PR:		ports/141946
Submitted by:	peter.schuller at infidyne.com
2009-12-24 22:14:56 +00:00
Jun Kuriyama
b69b437cb0 Patches text with given patch.
WWW: http://search.cpan.org/dist/Text-Patch/
2009-12-20 14:14:57 +00:00
Wen Heping
bccded332e This module is a companion for XML::LibXML. It operates with LibXML
objects, could return or accept LibXML objects, and may be used for
easy data transformations.

It is faster in parsing then XML::Simple, XML::Hash, XML::Twig and
of course much slower than XML::Bare.

It is faster in composing than XML::Hash, but slower than XML::Simple.

WWW:    http://http://search.cpan.org/dist/XML-Hash-LX/
2009-12-09 06:55:01 +00:00
Lars Thegler
3a3b9b6e55 Add p5-String-Truncate 0.102, a module for when strings are too long to
be displayed in....
2009-12-06 14:39:30 +00:00
Steven Kreuzer
404637ffaf Parse::BooleanLogic is a fast parser for boolean expressions. Originally
written for Request Tracker to parse SQL like expressions, it can be
used to parse other boolean logic sentences with OPERANDs joined using
binary OPERATORs and grouped and nested using parentheses.
2009-11-24 14:51:07 +00:00
Martin Wilke
f411386c94 This is the table based input method framework for IBus (Intelligent Input
Bus). It does not include real tables except two demo layouts, "Compose"
(which mimics Compose Key input) and "LaTeX" (which allows you to use LaTeX
commands as keyboard sequences to input various symbols).

WWW: http://code.google.com/p/ibus

PR:		ports/140761
Submitted by:	Nikola Lecic <nikola.lecic at anthesphoria.net>
2009-11-22 13:21:29 +00:00
Wen Heping
17a564386d Intelligent Input Bus for Linux / Unix OS, iBus, is the next generation
input framework, developed by the developer of scim-python, includes all
its function, and much more.

This is the ibus-qt port, the input method module for Qt4 of ibus.
You may select ibus as the input method in qtconfig after installation.

WWW: http://code.google.com/p/ibus

PR:		ports/140665
Submitted by:	Henry Hu <henry.hu.sh@gmail.com>
2009-11-22 12:14:10 +00:00
Boris Samorodov
8410624681 The project aims to create a feature-rich dictionary lookup program.
It supports:
    * Babylon .BGL files, complete with images and resources;
    * StarDict .ifo/.dict./.idx/.syn dictionaries;
    * Dictd .index/.dict(.dz) dictionary files;
    * ABBYY Lingvo .dsl source files, together with abbreviations.
      The files can be optionally compressed with dictzip. Dictionary
      resources can be packed together into a .zip file;
    * ABBYY Lingvo .lsa/.dat audio archives. Those can be indexed
      separately, or be referred to from .dsl files.

LICENSE: GPL3 or later

WWW: http://goldendict.berlios.de/
2009-11-13 14:07:49 +00:00
Dmitry Sivachenko
4b2e0ffef2 New port: XML-Validate.
XML::Validate is a generic interface to different XML validation backends.
2009-11-10 11:16:35 +00:00
Wen Heping
10e3edfe53 Pod::Autopod is designed to generate pod documentation of a perl
class by analysing its code. The idea is to have something similar
like javadoc. So it uses also comments written directly obove the
method definitions. It is designed to asumes a pm file which
represents a class.

Of course it can not understand every kind of syntax, parameters,
etc. But the plan is to improve this library in the future to
understand more and more automatically.

WWW: http://search.cpan.org/dist/Pod-Autopod/

Approved by:	miwi,jadawin(mentors,implicit)
2009-10-31 12:49:47 +00:00
Sergey Skvortsov
ad255fb0d4 textproc/p5-XML-LibXML-Common merged into textproc/p5-XML-LibXML. 2009-10-30 16:18:57 +00:00
Wen Heping
793a62517d Markdown is a text-to-HTML filter; it translates an easy-to-read and
easy-to-write structured text format into HTML. Markdown's text format
is most similar to that of plain text email, and supports features such
as headers, *emphasis*, code blocks, blockquotes, and links.

Markdown's syntax is designed not as a generic markup language, but
specifically to serve as a front-end to (X)HTML. You can use span-level
HTML tags anywhere in a Markdown document, and you can use block level
HTML tags (like <div> and <table> as well).

Text::MultiMarkdown implements the MultiMarkdown markdown syntax
extensions from: http://fletcherpenney.net/multimarkdown/

WWW: http://search.cpan.org/dist/Text-MultiMarkdown/

Approved by:	mentors(implicit)
2009-10-23 02:59:15 +00:00
Martin Wilke
eb411f3c8b OpenOffice::OODoc is an extensible Perl interface allowing direct
read/write operations on files which comply with the
OASIS Open Document Format for Office Applications (ODF),
i.e. the ISO/IEC 26300:2006 standard.

It provides a high-level, document-oriented language, and isolates
the programmer from the details of the file format. It can process
different document classes (texts, spreadsheets, presentations,
and drawings). It can retrieve or update styles and images,
document metadata, as well as text content.

WWW: http://search.cpan.org/dist/OpenOffice-OODoc/

PR:		ports/139548
Submitted by:	meyser at xenet.de
2009-10-21 09:16:05 +00:00
Martin Wilke
3686afa8be This is a keyboard for input of the Malayalam according to the transliteration
scheme called Mozhi (https://sites.google.com/site/cibu/mozhi). The keymap is
written in Keyman keyboard language and developed as a part of Varamozhi
Project under the LGPL license.

The Mozhi is intended to be the most intuitive scheme for Malayalam speakers.
It simplifies what the user needs to remember and is is not phonetically
accurate.

This keymap supports the current standard for Malayalam Chillus (i.e. without
special encoding). It offers mnemonic keyboard functionality and smart-quote
functionality with comas and numerals.

This port installs the keyboard so that it can be used through SCIM KMFL
IMEngine (textproc/scim-kmfl-imengine).

WWW: http://varamozhi.sourceforge.net/

PR:		ports/139498
Submitted by:	Nikola Lecic <nikola.lecic at anthesphoria.net>
2009-10-21 09:15:39 +00:00
Wen Heping
873c5c197a XML::Reader provides a simple and easy to use interface for
sequentially parsing XML files (so called "pull-mode" parsing)
and at the same time keeps track of the complete XML-path.

It was developped as a wrapper on top of XML::Parser.

WWW:    http://search.cpan.org/dist/XML-Reader/

Approved by:	mentors(implicit)
2009-10-20 23:51:38 +00:00
Martin Wilke
7c86755dbb This implements a spell checker to Text::SpellChecker
using ZConf::GUI.

textspellchecker - Spell checker script.

WWW: http://search.cpan.org/dist/Text-SpellChecker-GUI

PR:		ports/139176
Submitted by:	Zane C, Bowers <vvelox at vvelox.net>
2009-09-30 14:21:44 +00:00
Martin Wilke
9417781f42 This is the m17n IMEngine for IBus (Intelligent Input Bus) framework. It
allows you to use official and contributed keyboard layouts of the m17n
project (available via devel/m17n-db and textproc/m17n-contrib) through
standard IBus interface. m17n-lib currenty supports input of more than 60
languages with more than 70 language-specific input methods.

WWW: http://code.google.com/p/ibus

PR:		ports/138521
Submitted by:	Nikola Lecic <nikola.lecic at anthesphoria.net>
2009-09-04 20:39:11 +00:00
Martin Wilke
fcb480eff1 This is a keyboard for typesetting Ancient Greek with precomposed Unicode
characters. It is written in Keyman Keyboard Language by SIL Non-Roman Script
Initiative (NRSI).

The main purpose of the keyboards is to provide a wide range of keying options,
so many characters can be entered in multiple ways. The features include:

  * preserving the context when deleting;
  * choosing the correct code for the sigma depending upon the encoding and
    the context (so the correct final form is used when appropriate);
  * understanding the context of gamma so that it can be typed as 'n' before
    kappa, xi or chi and as 'ng' before another gamma.
  * support for Greek punctuation.

WWW: http://scripts.sil.org/KeymanKeyboardLinks#e9f80714

PR:		ports/138447
Submitted by:	Nikola Lecic <nikola.lecic at anthesphoria.net>
2009-09-02 17:26:54 +00:00
Martin Wilke
df43c95b1e This is a keyboard for input of the standardized Yi script of southwestern
China with Unicode Yi fonts. It is written in Keyman keyboard language and
developed by SIL Non-Roman Script Initiative (NRSI).

To keyboard a Yi syllable, you should type the Pinyin romanization for that
syllable, followed by a space. For keyboarding punctuation, use the usual
punctuation keystrokes.

The keyboard is compatible with Yi range as defined in Unicode 3.0 and it does
not provide keystrokes for the Yi Radicals which were added to Unicode 3.2
(U+A4A2..U+A4A3, U+A4B4, U+A4C1, U+A4C5).

WWW: http://scripts.sil.org/SILYI_home

PR:		ports/138448
Submitted by:	Nikola Lecic <nikola.lecic at anthesphoria.net>
2009-09-02 17:25:47 +00:00
Martin Wilke
56267d067e This is a set of two keyboards that provides a single implementation for many
Roman writing systems across Africa, based on results compiled from data from
Benin, Burkina Faso, Cameroon, Chad, Cote d'Ivoire, Nigeria, Senegal and Togo.

The keyboards are written in Keyman keyboard language and developed by SIL
Non-Roman Script Initiative (NRSI). The software is distributed under the
X11-style license (http://scripts.sil.org/X11License).

This port installs the keyboards so that they can be used through SCIM KMFL
IMEngine (textproc/scim-kmfl-imengine). Two layouts are provided:

  * mnemonic layout for any keyboard (using deadkeys);
  * positional layout for US keyboard (using deadkeys and/or shift-states, i.e.
    RALT and LALT keys).
WWW: http://scripts.sil.org/AfricanKeyboard1

PR:		ports/138464
Submitted by:	Nikola Lecic <nikola.lecic at anthesphoria.net>
2009-09-02 17:24:41 +00:00
Martin Wilke
d7343fba11 TECkit (Text Encoding Conversion toolkit) is a toolkit for converting data
between 8-bit legacy encodings and Unicode. It can also be used for
transliteration of Unicode between different scripts.

TECkit uses a mapping description language (mapping byte encodings to Unicode).
Mapping rules can be extended by (1) the use of character sequences rather than
single characters on either side; (2) by the addition of contextual constraints
(environments) determining when a rule should apply; (3) and by the use of
character classes, optional and repeatable elements, grouping and alternation
to express more complex patterns to be matched and processed.

TECkit is particularly useful with XeTeX (Unicode-aware derivate of TeX).

The following binaries are provided:

  teckit_compile  mapping compiler that allows binary mapping tables (.tec)
                  to be built from TECkit description files (.map)
  sfconv          a tool for converting Standard Format (SF) files
  txtconv         a utility to apply TECkit mappings to plain-text files

WWW: http://scripts.sil.org/TECkit

PR:		ports/138212
Submitted by:	Nikola Lecic <nikola.lecic at anthesphoria.net>
2009-08-28 19:15:02 +00:00
Martin Wilke
04bf92d9f5 Template Toolkit plugin interface to Text::Autoformat module. This is
a separate distribution as of Template-Toolkit 2.22.

WWW: http://search.cpan.org/dist/Template-Plugin-Autoformat

PR:		ports/137941
Submitted by:	Vivek Khera <vivek at khera.org>
2009-08-27 18:46:51 +00:00
Martin Wilke
3fcafc814f Sphinx search engine API Perl client.
WWW: http://search.cpan.org/dist/Sphinx-Search

PR:		ports/138000
Submitted by:	Vivek Khera <vivek at khera.org>
2009-08-22 22:23:22 +00:00
Martin Wilke
0fbfd24033 Library to create spreadsheet files compatible with
MS Excel 97/2000/XP/2003 XLS files, on any platform, with Python 2.3
to 2.6

xlwt is a library for generating spreadsheet files that are compatible
with Excel 97/2000/XP/2003, OpenOffice.org Calc, and Gnumeric. xlwt
has full support for Unicode. Excel spreadsheets can be generated on
any platform without needing Excel or a COM server. The only
requirement is Python 2.3 to 2.6. xlwt is a fork of pyExcelerator.

WWW: http://www.python-excel.org/

PR:		ports/137969
Submitted by:	Dikshie
2009-08-22 22:10:52 +00:00
Martin Wilke
79b10a91b4 Library for developers to extract data from Microsoft Excel (tm)
spreadsheet files

Extract data from new and old Excel spreadsheets on any platform.
Pure Python (2.1 to 2.6). Strong support for Excel dates. Unicode-aware.

WWW: http://www.python-excel.org/

PR:		ports/137970
Submitted by:	Dikshie
2009-08-22 22:09:37 +00:00
Martin Wilke
ebd38f8e8d A logging framework handler that tracks when messages above a certain
level have been logged.

This is a handler for the python standard logging framework that can
be used to tell whether messages have been logged at or above a certain
level.

This can be useful when wanting to ensure that no errors have been
logged before committing data back to a database.

WWW: http://pypi.python.org/pypi/errorhandler/1.0.0

PR:		ports/137970
Submitted by:	Dikshie
2009-08-22 22:08:15 +00:00
Martin Wilke
2d3ff17aad Text::ASCIIMathML is a parser for ASCIIMathML text which produces
MathML XML markup strings that are suitable for rendering by any
MathML-compliant browser.

WWW: http://search.cpan.org/dist/Text-ASCIIMathML/

PR:		ports/137605
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-08-13 16:41:48 +00:00
Steven Kreuzer
5e7eacd121 Perl extension for inclusive and exclusive canonicalization of XML using libxml2
PR:		ports/137474
Submitted by:	Gea-Suan Lin <gslin@gslin.org>
2009-08-07 16:01:56 +00:00
Dmitry Sivachenko
732e202086 New port: MARC::XML.
MARC-XML is an extension to the MARC-Record distribution for working with
XML data encoded using the MARC21slim XML schema from the Library of Congress.

For more details see: http://www.loc.gov/standards/marcxml/
2009-08-06 11:11:09 +00:00
Martin Wilke
80da2adf57 MARC::Charset allows you to turn MARC-8 encoded strings into UTF-8
strings. MARC-8 is a single byte character encoding that predates
unicode, and allows you to put non-Roman scripts in MARC bibliographic
records.

WWW:	http://search.cpan.org/dist/MARC-Charset

PR:		ports/137433
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-08-05 09:10:54 +00:00
Dmitry Marakasov
163c3fea0b This module creates a layer on top of DOM that allows you to program
in a "push" style rather than "pull". Once the document has been parsed
and you have a DOM object, you can call on the DOMHandler's traverse()
method to apply a set of call-back routines to all the nodes in a tree.

WWW: http://search.cpan.org/dist/XML-DOMHandler-1.0/

PR:		137424
Submitted by:	Stefan Pauly <stefan@fh-mainz.de>
2009-08-05 00:39:35 +00:00
Martin Wilke
ea4f66d196 Python lib for:
* creole markup -> html
    * html -> creole markup

WWW:	http://code.google.com/p/python-creole/

PR:		ports/137235
Submitted by:	Stanislav Svirid <count at 211.ru>
2009-07-29 14:13:43 +00:00
Martin Wilke
76c5fd67c5 Apertium is an open-source machine translation platform, initially aimed
at related-language pairs but recently expanded to deal with more
divergent language pairs (such as English-Catalan). The platform
provides:

1. a language-independent machine translation engine
2. tools to manage the linguistic data necessary to build a machine
   translation system for a given language pair and
3. linguistic data for a growing number of language pairs

WWW: http://www.apertium.org/

PR:		ports/137135
Submitted by:	Mykola Dzham <freebsd at levsha.org.ua>
2009-07-28 17:47:44 +00:00
Martin Wilke
c7e0c5c34c lttoolbox is a toolbox for lexical processing, morphological analysis
and generation of words. The analysis is the process of splitting of
words splitting a word (e.g. cats) into its lemma 'cat' and the
grammatical information <n><pl>.  The generation is the opposite
process.
The package is split into three programs, lt-comp, the compiler,
lt-proc, the processor, and lt-expand, which generates all possible
mappings between surface forms and lexical forms in the dictionary.

WWW: http://wiki.apertium.org/wiki/Lttoolbox

PR:		ports/137134
Submitted by:	Mykola Dzham <freebsd at levsha.org.ua>
2009-07-28 17:47:02 +00:00
Anton Berezin
b97bef31e4 Add textproc/p5-XML-Parser-Style-EasyTree 0.03, a Perl module that
parses XML to a simple tree.
2009-07-28 13:36:18 +00:00
Sergey Skvortsov
3925af777f Add p5-String-RewritePrefix 0.004, rewrite strings based on a set of
known prefixes.

PR:		ports/137133
Submitted by:	Bill Brinzer <bill.brinzer xx gmail.com>
2009-07-28 09:18:40 +00:00
Renato Botelho
b16dc471d7 This is a keyboard for input of the complex Biblical Hebrew (including
cantillation marks) with Unicode fonts. It is written in Keyman keyboard
language and developed by SIL Non-Roman Script Initiative (NRSI).

This port installs the keyboard so that it can be used through SCIM KMFL
IMEngine (textproc/scim-kmfl-imengine).

The keyboard is provided under the terms of MIT/X11 License.

WWW: http://scripts.sil.org/SILHebrUnic2
     http://scripts.sil.org/SILHebrUni_Documentation

PR:		ports/136768
Submitted by:	Nikola Lecic <nikola.lecic at anthesphoria.net>
2009-07-16 17:33:06 +00:00
Renato Botelho
8593ce8c5c This keyboard is designed to enable simple input of most Latin script
languages, including all major European Latin-script languages. The
keyboard is written in KMN Keyboard Language by the KMN language
developer, Tavultesoft (http://www.tavultesoft.com). The keyboard
uses punctuation and letter keys in sequence to access diacritic and
other letters.

This port installs the keyboard so that it can be used through SCIM
KMFL IMEngine (textproc/scim-kmfl-imengine).

Some of the supported languages include: Afrikaans, Albanian,
Balearic, Basque, Breton, Catalan, Croatian, Czech, Danish, Dutch,
Esperanto, Estonian, Faroese, Finnish, French, Gaelic, Galician,
German, Hungarian, Icelandic, Inuktitut, Italian, Kashubian, Ladin,
Latvian, Lithuanian, Maltese, Norwegian, Nynorsk, Polish, Portugese,
Romansch, Saami, Serbian, Slovak, Slovenian, Sorbian, Spanish,
Swedish, Tagalog, Turkish, Valencian, Vlaams, Walloon, Welsh and Zulu.

The keyboard is distributed under the terms of 3-clause BSD-licence.

WWW: http://eurolatin.keymankeyboards.com/

PR:		ports/136150
Submitted by:	Nikola Lecic <nikola.lecic at anthesphoria.net>
2009-07-16 17:32:11 +00:00
Dmitry Marakasov
02cd8b6e64 Base class and utility methods for manipulating Perl via PPI. Pulled
out from the Padre::Task::PPI code.

WWW: http://search.cpan.org/dist/PPIx-EditorTools/

PR:		136699
Submitted by:	Cezary Morga <cm@therek.net>
2009-07-15 13:47:55 +00:00
Philip M. Gollucci
9eacfa6b4b This is an interface into HTML::SuperForm through the Template Toolkit. When
created without arguments (i.e. [% USE form = HTML.SuperForm %]), the
Template's stash is searched for an Apache object or a CGI object to pass to
HTML::SuperForm's constructor.

When created with arguments (i.e. [% USE form = HTML.SuperForm(arg) %]),
the arguments are passed to HTML::SuperForm's constructor.

A dep for www/p5-Gantry

Reported by:    Cezary Morga <cm@therek.net>
2009-07-14 23:18:18 +00:00
Philip M. Gollucci
94b541488f Used in its basic form, this module provides an interface for generating basic
HTML form elements much like HTML::StickyForms does. The main difference is
HTML::SuperForm returns HTML::SuperForm::Field objects rather than plain HTML.
This allows for more flexibilty when generating forms for a complex application.

To get the most out of this module, use it as a base (Super) class for your own
form object which generates your own custom fields. If you don't use it this way,
I guess there's really nothing Super about it. Example are shown later in the document.

The interface was designed with mod_perl and the Template Toolkit in mind,
but it works equally well in any cgi environment.

A dep for www/p5-Gantry

Reported by:    Cezary Morga <cm@therek.net>
2009-07-14 23:17:35 +00:00
Philip M. Gollucci
e271fb5d9f LuaExpat is a SAX XML parser based on the Expat library. SAX is the Simple API
for XML and allows programs to:

* process a XML document incrementally, thus being able to handle huge
  documents without memory penalties;
* register handler functions which are called by the parser during the
  processing of the document, handling the document elements or text.

With an event-based API like SAX the XML document can be fed to the parser in
chunks, and the parsing begins as soon as the parser receives the first
document chunk. LuaExpat reports parsing events (such as the start and end of
elements) directly to the application through callbacks. The parsing of huge
documents can benefit from this piecemeal operation.

WWW: http://www.keplerproject.org/luaexpat

PR:		ports/136265
Submitted by:	Andrew Lewis <dru at silenceisdefeat.net>
2009-07-14 06:02:55 +00:00
Martin Wilke
ad6e634e88 A simple wrapper for parsing OPML files.
WWW:	http://rubyforge.org/projects/opml/

PR:		ports/136719
Submitted by:	TERAMOTO Masahiro <markun at onohara.to>
2009-07-13 22:15:30 +00:00
Martin Wilke
8f1a58689c HTMLEntities is a simple library to facilitate encoding and decoding of
named (&yacute; and so on) or numerical (&#123; or &#x12a;) entities
in HTML and XHTML documents.

WWW:	http://rubyforge.org/projects/htmlentities/

PR:		ports/136713
Submitted by:	TERAMOTO Masahiro <markun at onohara.to>
2009-07-13 22:13:24 +00:00
Martin Wilke
fcfc412a69 LaTeX::Pod converts LaTeX sources to Perl's POD (Plain old
documentation) format. Currently only a subset of the available
LaTeX language is supported.

WWW:	http://search.cpan.org/dist/LaTeX-Pod/

PR:		ports/136639
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-07-12 22:19:58 +00:00
Dennis Herrmann
83d8dfd1c1 Termcolor is a library for ANSI color formatting
like HTML for output in terminal.

WWW:	http://github.com/jugyo/termcolor/tree/master
2009-07-12 14:05:06 +00:00
Dmitry Marakasov
c1eba0ce96 Little and efficient Perl module to manage xml data.
WWW:	http://search.cpan.org/dist/XML-TinyXML/

PR:		136395
Submitted by:	Rick van der Zwet <info@rickvanderzwet.nl>
2009-07-07 13:27:38 +00:00
Beat Gaetzi
633cd7fd8a POD::Abstract provides a means to load a POD (or POD compatible)
document without direct reference to it's syntax, and perform
manipulations on the abstract syntax tree.

This can be used to support additional features for POD, to format
output, to compile into alternative formats, etc.

While Pod looks like a simple format, the specification calls for
a number of special cases to be handled, and that makes any software
that works on Pod as text more complex than it needs to be.  In
addition to this, Pod does not lend itself to a natural structured
model. This makes it difficult to manipulate without damaging the
validity of the document.

Pod::Abstract solves these problems by loading the document into a
structured tree, and providing consistent traversal, searching,
manpulation and re-serialisation. Pod related utilities are easy
to write using Pod::Abstract.

WWW: http://search.cpan.org/dist/Pod-Abstract/

PR:		ports/135181
Submitted by:	Cezary Morga <cm AT therek.net>
2009-06-20 19:06:24 +00:00
Martin Wilke
4fedf3e38f - Update to 1.0.1.12
- Connect py-markdown2 to the build

PR:		135247
Submitted by:	Wen Heping <wenheping@gmail.com> (maintainer)
2009-06-13 23:51:23 +00:00
Steven Kreuzer
53cba4f805 Atom takes in plain text with simple layout hints and outputs as HTML
or LaTeX.

PR:		ports/135424
Submitted by:	Chris Rees <chris@bayofrum.net>
Approved by:	wxs (mentor)
2009-06-12 13:40:55 +00:00
Martin Wilke
c6c35f862b This module supports multipage tables via the xtab and the longtable
package. For publication quality tables it utilizes the booktabs
package. It also supports the tabularx and tabulary packages for
nicer fixed-width tables. Furthermore, it supports the colortbl
package for colored tables optimized for presentations.  The powerful
new ctable package is supported and especially recommended when
footnotes are needed. LaTeX::Table ships with some predefined, good
looking themes.

WWW:	http://search.cpan.org/dist/LaTeX-Table/

PR:		ports/135243
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-06-05 22:07:41 +00:00
Martin Wilke
679ef90516 The LaTeX::Driver module encapsulates the details of invoking the
Latex programs to format a LaTeX document. Formatting with LaTeX
is complicated; there are potentially many programs to run and the
output of those programs must be monitored to determine whether
further processing is required.

WWW:	http://search.cpan.org/dist/LaTeX-Driver/

PR:		ports/135170
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-06-05 22:07:10 +00:00
Martin Wilke
1dfbbf579f p5::LaTeX::Encode provides a function to encode text that is to
be formatted with LaTeX. It encodes characters that are special
to LaTeX or that are represented in LaTeX by LaTeX commands.

WWW:	http://search.cpan.org/dist/LaTeX-Encode/

PR:		ports/135171
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-06-05 22:05:50 +00:00
Martin Wilke
d041d194ab This module provides a parser which parses and interprets (though
not fully) LaTeX documents and returns a tree-based representation
of what it finds. This tree is a LaTeX::TOM::Tree. The tree contains
LaTeX::TOM::Node nodes.
WWW:	http://search.cpan.org/dist/LaTeX-TOM/

PR:		ports/135245
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-06-05 22:03:53 +00:00
Martin Wilke
f128ddfb8e 2009-05-31 biology/p5-bioperl-devel: no longer under development
2009-05-31 biology/p5-bioperl-run-devel: no longer under development
2009-06-01 net-p2p/deluge05: use net-p2p/deluge instead
2009-06-03 textproc/gmat: failed to build for a long time, no maintainer and apparently no users either
2009-06-05 20:38:36 +00:00
Gabor Kovesdan
00bc58eac0 PDF-Table is intended for table generation using PDF::API2.
WWW:	http://search.cpan.org/~omega/

PR:		ports/134965
Submitted by:	scheidell at secnap.net
2009-06-05 17:19:43 +00:00
Boris Samorodov
c11516af7b Here are new Linux Fedora 10 infrastructure ports.
Those ports are intended to be used with 8-CURRENT at least
with SVN r192206.

If you want to switch to linux-f10 ports, please define at /etc/make.conf:
  OVERRIDE_LINUX_BASE_PORT=f10
  OVERRIDE_LINUX_NONBASE_PORTS=f10

An upgrading procedure is shown at /usr/ports/UPDATING, entries 20090401
and 20070327.

For the first time all tested linux ports work as expected(!):
. acroread8;
. google-earth;
. skype;
. seamonkey.

Many thanks for kernel folks who really did the main work
(and I wrote only some lines of ports).

There is a good chance that those ports may become a default
for 8.0-RELEASE. Please, test and report back to emulation@ ML.
2009-06-01 17:26:31 +00:00
Philip M. Gollucci
c6923fb5be * XPath support for document searching
* CSS3 selector support for document searching
* XML/HTML builder
* Drop in replacement for Hpricot (though not bug for bug)

Nokogiri parses and searches XML/HTML very quickly, and also has
correctly implemented CSS3 selector support as well as XPath support.

WWW:	http://nokogiri.rubyforge.org/nokogiri/

Submitted by:	Philip M. Gollucci <pgollucci at p6m7g8.com>
2009-05-18 21:01:16 +00:00
Martin Wilke
2905f00618 String::Escape - Registry of string functions, including backslash escapes
WWW: http://search.cpan.org/dist/String-Escape/

PR:		ports/134564
Submitted by:	Denis Pokataev <catone at cpan.org>
2009-05-17 13:06:10 +00:00
Martin Wilke
a5b74d6f13 - Connect p5-PDF-API3 to the build 2009-05-16 23:12:16 +00:00
Dennis Herrmann
dc9974675d - Update to 0.6.1052
- Connect to the build

PR:		ports/134304
Submitted by:	Alexey V.Degtyarev <alexey@renatasystems.org> (maintainer)
Approved by:	miwi (mentor)
2009-05-11 20:16:09 +00:00
Jose Alonso Cardenas Marquez
342ecab628 - New port: textproc/gambas2-gb-xml
The gambas XML component
2009-05-09 01:35:57 +00:00
Gerald Pfeifer
72259799a5 Remove textproc/sp per deprecation/expiration note. 2009-05-03 21:02:16 +00:00
Martin Wilke
4f5ceb6b92 XML::Feed::Deduper is deduper for XML::Feed.
You can write the aggregator more easily :)

WWW:	http://search.cpan.org/dist/XML-Feed-Deduper/

PR:		ports/134018
Submitted by:	TERAMOTO Masahiro <markun at onohara.to>
2009-05-01 11:09:10 +00:00
Steven Kreuzer
b4439e9824 Perl XS binding to libyaml which offers the best YAML support to date.
WWW:    http://search.cpan.org/~nuffin/YAML-LibYAML/

PR:     ports/134066
Submitted by:   TERAMOTO Masahiro <markun@onohara.to>
Approved by:    wxs (mentor)
2009-04-29 13:06:31 +00:00
Thierry Thomas
0c94c8b11c Add ky-aspell, aspell Kirghiz dictionary. 2009-04-28 19:32:57 +00:00
Martin Wilke
a26041c27f This is an implementation of John Gruber's Markdown text to html language.
There's not much here that differentiates it from any of the existing Markdown
implementations except that it's written in C instead of one of the vast flock
of scripting languages that are fighting it out for the Perl crown.

Markdown provides a library that gives you formatting functions suitable for
marking down entire documents or lines of text, a command-line program that you
can use to mark down documents interactively or from a script,
and a tiny (1 program so far) suite of example programs that show how to fully
utilize the markdown library.

WWW:		http://www.pell.portland.or.us/~orc/Code/markdown/

PR:		ports/134004
Submitted by:	Julien Laffaye <kimelto at gmail.com>
2009-04-26 23:46:43 +00:00
Max Brazhnikov
b8e407bcec Connect lexxia to build 2009-04-24 18:59:25 +00:00
Greg Larkin
db5b717622 docbook2X converts DocBook documents into man pages and Texinfo
documents.

It aims to support DocBook version 4.2, excepting the features that
cannot be supported or are not useful in a man page or Texinfo
document.

WWW:    http://docbook2x.sourceforge.net/
2009-04-24 03:09:37 +00:00
Martin Wilke
ba35815b20 This module is a blunt rewrite of XML::Simple (by Grant McLean)
to use the XML::LibXML parser for XML structures, where the original
uses plain Perl or SAX parsers.

WWW:	http://http://search.cpan.org/dist/XML-LibXML-Simple/

PR:		ports/133875
Submitted by:	TERAMOTO Masahiro <markun at onohara.to>
2009-04-23 09:02:29 +00:00
Philip M. Gollucci
77cb81877b Class XmlSimple offers an easy API to read and write XML. It is a Ruby
translation of Grant McLean's Perl module XML::Simple. Simply put,
it automatically converts XML documents into a Ruby Hash.

WWW: http://rubyforge.org/projects/xml-simple

Sponsored by:   RideCharge Inc.
2009-04-22 04:35:59 +00:00
Martin Wilke
67aa920afa This module is a tiny shim to unbreak XML files that have had their
entities doubly-encoded.

WWW:	http://search.cpan.org/dist/XML-DoubleEncodedEntities/

PR:		ports/133600
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-04-11 22:35:42 +00:00
Boris Samorodov
824eda00fc Here are new Linux Fedora 8 infrastructure ports.
The recommended version of FreeBSD to use them is 8-CURRENT.
FreeBSD-7.x is not fully compatible with compat.linux.osrelease
2.6.16. Some syscalls cannot be MFCed due to native FreeBSD
ABI breakage.

Usage (and package building):
1. define compat.linux.osrelease=2.6.16;
2. add following variables to /etc/make.conf:
   . OVERRIDE_LINUX_BASE_PORT=f8;
   . OVERRIDE_LINUX_NONBASE_PORTS=f8.

Approved by:	bsam (me) ;-)
2009-04-01 15:25:39 +00:00
Sergey Skvortsov
1e23f938bd Add p5-Unicode-Escape 0.0.2, escape and unescape Unicode characters
other than ASCII.
2009-03-29 19:24:20 +00:00
Lars Balker Rasmussen
6837367be6 New port, textproc/p5-Text-Context:
Given a piece of text and some search terms, produces an object
which locates the search terms in the message, extracts a reasonable-length
string containing all the search terms, and optionally dumps the string out
as HTML text with the search terms highlighted in bold.
2009-03-29 14:26:51 +00:00
Pav Lucistnik
821d901072 Intelligent Input Bus for Linux / Unix OS, iBus, is the next generation input
framework, developed by the developer of scim-python, includes all its
function, and much more.

This is the base port, you need input methods such as pinyin to input text.

WWW: http://code.google.com/p/ibus

PR:		ports/128371
Submitted by:	Henry Hu <henry.hu.sh@gmail.com>
2009-03-26 09:24:35 +00:00
Martin Wilke
09d9113fd5 The mdocml utilities interface with the mdoc library to compile
mdoc-roff documents into a variety of output formats.

WWW: http://mdocml.bsd.lv/

PR:		ports/132449
Submitted by:	Ulrich Spoerlein <uspoerlein at gmail.com>
2009-03-25 17:48:31 +00:00
Chin-San Huang
8c500d5b12 Add p5-Unicode-EastAsianWidth, East Asian Width properties.
This module provide user-defined Unicode properties that deal with East
Asian characters' width status, as specified in
http://www.unicode.org/unicode/reports/tr11/.

WWW: http://search.cpan.org/dist/Unicode-EastAsianWidth/
2009-03-23 20:11:37 +00:00
Martin Wilke
1d298e91fb Solr is an open source enterprise search server based on the Lucene
Java search library, with XML/HTTP and JSON APIs, hit highlighting,
faceted search, caching, replication, a web administration interface
and many more features. It runs in a Java servlet container such as
Tomcat.

WWW:	http://lucene.apache.org/solr/

PR:		ports/132665
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2009-03-22 13:47:54 +00:00
Martin Wilke
12c0025123 p5-XML-Bare is a minimal XML parser, schema checker and pretty-printer
using C internally.

WWW: http://search.cpan.org/dist/XML-Bare/

PR:		ports/132731
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-03-19 21:36:13 +00:00
Peter Pentchev
be3a678fba Add a port of confget-1.01, a simple tool to read variables from
configuration files and display their values in a way suitable for
use in shell scripts.
2009-03-18 11:12:04 +00:00
Thierry Thomas
5866f3a92b Aspell Ancient Greek dictionary.
http://www.himeros.eu/
2009-03-17 18:05:42 +00:00
Martin Wilke
abb5037267 PyStemmer provides access to efficient algorithms for calculating a
"stemmed" form of a word. This is a form with most of the common
morphological endings removed; hopefully representing a common
linguistic base form. This is most useful in building search engines
and information retrieval software; for example, a search with stemming
enabled should be able to find a document containing "cycling" given the
query "cycles".

PyStemmer provides algorithms for several (mainly european) languages,
by wrapping the libstemmer library from the Snowball project in a Python
module. It also provides access to the classic Porter stemming algorithm
for english: although this has been superceded by an improved algorithm,
the original algorithm may be of interest to information retrieval
researchers wishing to reproduce results of earlier experiments.

WWW:	http://pypi.python.org/pypi/PyStemmer/

PR:		ports/132695
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-03-16 21:47:30 +00:00
Martin Wilke
419a983bfb This module provides a set of methods for adding (updating) and
deleting entries in an existing solr server.

WWW:	http://search.cpan.org/dist/Solr/

PR:		ports/132666
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2009-03-15 19:41:56 +00:00
Martin Wilke
aaad10ab48 This is an experimental distribution that attempts to intuit the underlying
indent "policy" for a text file (most likely a source code file).

WWW: http://search.cpan.org/dist/Text-FindIndent/

PR:		ports/132406
Submitted by:	Cezary Morga <cm at therek.net>
2009-03-15 17:17:04 +00:00
Martin Wilke
e2307bb6cc This extension provides WBXML (Wireless Binary XML) conversion capabilities
using the libwbxml library, which can be found at https://libwbxml.opensync.org/

WWW:	http://pecl.php.net/package/WBXML/

PR:		ports/132552
Submitted by:	Florian Smeets <flo at kasimir.com>
2009-03-12 16:41:31 +00:00
Martin Wilke
27f11aeb62 FormValidator::Simple like lightweight form validation library.
WWW:	http://search.cpan.org/dist/FormValidator-Lite/

PR:		ports/132444
Submitted by:	TAKAHASHI Kaoru <kaoru at kaisei.org>
2009-03-11 16:43:37 +00:00
Gerald Pfeifer
331bf95828 Remove textproc/latte and textproc/nicetext per expiration note. 2009-03-07 19:01:07 +00:00
Martin Wilke
07e4c39639 Tempita is a small templating language for text substitution.
This isn't meant to be the Next Big Thing in templating; it's
just a handy little templating language for when your project
outgrows string.Template or % substitution. It's small, it
embeds Python in strings, and it doesn't do much else.

WWW:	http://pythonpaste.org/tempita/

Submitted by:	Wen Heping <wenheping at gmail.com>
2009-02-27 15:33:28 +00:00
Lars Thegler
71e03e5714 Add p5-Text-Truncate 1.03, perl module with simple string truncating
routine.
2009-02-25 09:53:20 +00:00
Martin Wilke
c04a70f4be EBook-Tools contains a library and a command-line tool for unpacking,
creating, correcting, and repacking electronic books.

Current unpacking support is limited to PalmDoc and Mobipocket and
generation is limited to EPub.  The metadata correction tools are
quite extensive, however.  For more details, see the POD information
on EBook::Tools and EBook::Tools::Unpack.

WWW: http://search.cpan.org/dist/EBook-Tools/

PR:		ports/131901
Submitted by:	simond at irrelevant.org
2009-02-23 20:58:38 +00:00
Max Brazhnikov
2d8b5de9c8 Update PyQt4 ports to 4.4.4, new ports:
devel/py-qt4-help
multimedia/py-qt4-phonon
textproc/py-qt4-xmlpatterns
www/py-qt4-webkit

Update QScintilla2 to 2.3.2, PyQt3 to 3.17.6, PyKDE3 to 3.16.2.

Pass maintainership to kde@FreeBSD.org. Thanks Danny Ricin for his great work.

PR:		based on ports/130219
Submitted by:	Dima Panov" <fluffy at fluffy.khv.ru>
2009-02-14 22:58:55 +00:00
Anton Berezin
82c2a84b85 Add textproc/p5-String-BufferStack 1.12, a Perl module that provides
nested buffers for templating systems.
2009-02-10 12:11:31 +00:00
Martin Wilke
f35b39863a Minimalistic cmdline tool to fetch RSS news from HTTP or local files and
convert them into plain text, formatted html, or token-separated strings.
This tool can be easily integrated into shellscripts, cron tasks, motd's,
etc.

WWW:	http://nopcode.org/blog/rss2html.html

PR:		ports/131266
Submitted by:	Dennis Herrmann <adox at mcx2.org>
2009-02-09 17:48:17 +00:00
Li-Wen Hsu
f1760a6de0 Add tokyodystopia 0.9.9, a full-text search system based on Tokyo
Cabinet.
2009-02-04 14:10:32 +00:00
Rong-En Fan
5920a5c3f1 - Add liblinebreak 1.0
Liblinebreak is an implementation of the line breaking algorithm as
  described in Unicode 5.1.0 Standard Annex 14, Revision 22. It breaks
  lines that contain Unicode characters. It is designed to be used in a
  generic text renderer. FBReader is one real-world example.

  WWW:	http://vimgadgets.sourceforge.net/liblinebreak/

PR:		ports/130949
Submitted by:	Yuri Pankov <yuri.pankov at gmail.com>
2009-01-26 16:50:45 +00:00
Gabor Pali
3643bd0963 A variety of alternative parser combinator libraries for the functional
programming language Haskell, including the original HuttonMeijer set.
The Poly sets have features like good error reporting, arbitrary token
type, running state, lazy parsing, and so on.  Finally, Text.Parse is a
proposed replacement for the standard Read class, for better
deserialisation of Haskell values from Strings.

WWW: http://www.cs.york.ac.uk/fp/polyparse/

Approved by:	gabor
2009-01-23 23:53:56 +00:00
Gabor Kovesdan
56af130059 uml2svg is an XSLT-based tool for converting UML Diagrams to SVG.
WWW: http://uml2svg.sourceforge.net/
2009-01-22 13:05:09 +00:00
Jun Kuriyama
64381c774a Add p5-XML-OPML-LibXML 0.04, perl extension to parse OPML document with
LibXML parser.
2009-01-21 13:23:19 +00:00
Cheng-Lung Sung
d6261f64db - replaced by textproc/nltk 2009-01-21 07:03:05 +00:00
Pav Lucistnik
d8c3357913 - Remove textproc/py-wbxml few days early before it's expiration time - latest
update to wbxml broke it
2009-01-19 18:13:21 +00:00
Philip M. Gollucci
2ccbebaf07 This package builds a docset from sources in different formats. The
generated documents can be all nicely interlinked and to have the same
look and feel.

Currently it knows to handle input formats:

 * POD * HTML

and knows to generate:

 * HTML * PS * PDF

WWW: http://search.cpan.org/~stas/DocSet
2009-01-15 03:26:08 +00:00
Sergey Matveychuk
8b5e622730 Expired port:
2008-12-11 textproc/p5-Text-CSV_PP: textproc/Text-CSV_PP was merged into textproc/p5-Text-CSV
2009-01-14 18:46:06 +00:00
Martin Wilke
81916b6f5e Escapes XML entities from text, so that you don't fall prey to people putting
quotes, less-than/greater-than, and ampersands, into variables that end up in
TT templates.

WWW: http://search.cpan.org/dist/Template-Plugin-XML-Escape

PR:		ports/130422
Submitted by:	kevit
2009-01-14 09:17:17 +00:00
Jose Alonso Cardenas Marquez
79e3add23a - New port: textproc/fpc-aspell
Interface to aspell spelling checker
2009-01-14 02:41:00 +00:00
Martin Wilke
dd52033209 p5-XML-Compile-Cache is a perl module to cache compiled
XML translators.

WWW:	http://search.cpan.org/dist/XML-Compile-Cache/

PR:		ports/130369
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-01-11 14:02:04 +00:00
Martin Wilke
7854dcaffb The program m17n-im-config provides a GUI for per-user configuration of
input methods of the m17n library (devel/m17n-lib and textproc/m17n-contrib).

The following methods are customisable at the moment: Unicode, Vietnamese,
Malayalam, Tibetan, Thai, Japanese and Chinese.

WWW: http://www.m17n.org
     http://www.m17n.org/common/im-config/index.html

PR:		ports/127893
Submitted by:	Nikola Lecic <nikola.lecic at anthesphoria.net>
2009-01-11 11:14:22 +00:00
Martin Wilke
6f312588cf This port installs the user-contributed input methods for the m17n library
(the official ones are installed through devel/m17n-lib). It currently
supports Punjabi, Sinhala, Telugu, Nepali, Russian, Assamese, Bengali,
Gujarati, Hindi, Kannada, Malayalam, Marathi, Oriya and Vietnamese layouts
with various sub-variants.

The port also provides the tbl2mim.awk script for conversion of keyboard
files used by textproc/scim-table-imengine into SCIM-independent .mim format
(usable by m17n library).

WWW: http://www.m17n.org/

PR:		ports/127894
Submitted by:	Nikola Lecic <nikola.lecic at anthesphoria.net>
2009-01-11 11:13:34 +00:00
Beech Rintoul
f652dc8619 Dikt is a dictionary application for KDE4. It is the client
side implementation of Dict protocol, with a browser interface
specialized for querying dictionaries.

WWW: http://diktv1.googlepages.com/

PR:		ports/130000
Submitted by:	Goran Tal <goran.tal at gmail.com>
2008-12-30 22:54:03 +00:00
Martin Wilke
a508c630d7 Markaby is a templating language for Ruby, with a plugin for Rails,
which allows you to write HTML templates in pure-Ruby (a la Builder.)

WWW:	http://markaby.rubyforge.org/
2008-12-28 16:08:38 +00:00
Gabor Kovesdan
578fbaf4bb Europass is an XML Schema to create CVs and Language Passports conforming
the European conventions.

WWW: http://europass.cedefop.europa.eu/
2008-12-26 03:04:41 +00:00
Sergey Skvortsov
0f166c5db1 Add p5-Text-Diff-Parser 0.08, parse patch files containing unified and
standard diffs.
2008-12-24 17:55:01 +00:00
Gabor Pali
7f81ee1e51 highlighting-kate is a syntax highlighting library for Haskell with
support for over 50 languages. The syntax parsers are automatically
generated from Kate syntax descriptions [1], so any syntax supported by
Kate can be added. An (optional) command-line program is provided,
along with a utility for generating new parsers from Kate XML syntax
descriptions.

Currently the following languages are supported: Ada, Asp, Awk, Bash,
Bibtex, C, Cmake, Coldfusion, Commonlisp, Cpp, Css, D, Djangotemplate,
Doxygen, Dtd, Eiffel, Erlang, Fortran, Haskell, Html, Java, Javadoc,
Javascript, Json, Latex, Lex, LiterateHaskell, Lua, Makefile, Matlab,
Mediawiki, Modula3, Nasm, Objectivec, Ocaml, Pascal, Perl, Php,
Postscript, Prolog, Python, Rhtml, Ruby, Scala, Scheme, Sgml, Sql,
SqlMysql, SqlPostgresql, Tcl, Texinfo, Xml, Xslt, Yacc.

WWW: http://johnmacfarlane.net/highlighting-kate

[1] http://kate-editor.org

PR:		ports/129690
Submitted by:	pgj
Approved by:	miwi
2008-12-21 12:02:56 +00:00
Martin Wilke
c2e6ce64f2 - Remove duplicate port of devel/hs-utf8-string-ghc
Submitted by:	Samy Al Bahra <sbahra at kerneled.org> (via irc)
		pav
2008-12-19 06:16:10 +00:00
Beech Rintoul
0d00158bd6 - Hook rubygem-spreadsheet into build
PR:		ports/129584
2008-12-15 05:44:45 +00:00
Martin Wilke
da977380ba 2008-09-19 japanese/escpf: Has been broken for more than 6 months
2008-09-19 java/java-gcj-compat: Has been broken for more than 6 months
2008-09-19 lang/screamer: Has been broken for more than 6 months
2008-10-01 misc/documancer: Unmaintained upstream
2008-09-19 misc/ipbt: Has been broken for more than 6 months
2008-10-13 multimedia/manslide: Use multimedia/smile instead
2008-09-19 net/globus4: Has been broken for more than 6 months
2008-09-19 net/p5-Parallel-MPI: Has been broken for more than 6 months
2008-01-28 net/p54u: website disappeared
2008-09-19 net-im/ginsu: Has been broken for more than 6 months
2008-09-19 net-p2p/py-kenosis-bittorrent: Has been broken for more than 6 months
2008-09-19 sysutils/sjog: Has been broken for more than 6 months
2008-09-19 textproc/Ebnf2ps: Has been broken for more than 6 months
2008-09-19 www/roxen: Has been broken for more than 6 months
2008-09-19 x11-fm/evidence: Has been broken for more than 6 months
2008-12-07 20:36:48 +00:00
Martin Wilke
4bc9da11f9 A UTF8 layer for IO, ByteStrings and Strings. The utf8-string package provides
operations for encoding UTF8 strings to Word8 lists and back, and for reading
and writing UTF8 without truncation.

WWW: http://github.com/glguy/utf8-string/

PR:		ports/129427
Submitted by:	Samy Al Bahra <sbahra at kerneled.org>
2008-12-06 15:33:01 +00:00
Martin Wilke
7fd3acb105 A small library that allows you to parse CSV formatted data efficiently
with Haskell.

WWW: http://code.haskell.org/~dons/code/bytestring-csv

PR:		ports/129162
Submitted by:	Samy Al Bahra <sbahra at kerneled.org>
2008-11-29 21:44:00 +00:00
Martin Wilke
20245525d2 This module is built on Text::Aspell, but adds some of the
functionality provided by the internal gnu aspell API. This allows
one to deal with blocks of text, rather than just words. For
instance, we provide methods for iterating through the text,
serializing the object (thus remembering where we left off), and
highlighting the current misspelled word within the text.

WWW: http://search.cpan.org/dist/Text-SpellChecker/
2008-11-23 16:35:26 +00:00
Hiroki Sato
053280bd76 Add textproc/linux-scim-gtk, Linux binary version of an SCIM module
for gtk library, separated from textproc/linux-scim-libs.
2008-11-21 15:45:58 +00:00
Martin Wilke
14e392f075 A program which can be used to converts a single HTML file or a collection
of related HTML files into a single LaTeX file.

WWW: http://www.iwriteiam.nl/html2tex.html

PR:		ports/128631
Submitted by:	martin.dieringer at gmx.de
2008-11-13 21:37:58 +00:00
Martin Wilke
b527b77010 Ruby wrapper around David Loren Parsons' discount, a fast,
BSD-licensed c implementation of John Gruber's Markdown plus
some aspects of SmartyPants.  Markdown is a text-to-HTML
conversion language for web writers, inspired by the format
of plain-text e-mail messages.  Markdown allows you to write
in an easy-to-read, easy-to-write plain text format, then
convert it to structurally valid XHTML (or HTML).

WWW: http://github.com/rtomayko/rdiscount/

PR:		ports/128548
Submitted by:	Daniel Roethlisberger <daniel at roe.ch>
2008-11-09 23:02:01 +00:00
Martin Wilke
00031d4dc5 Diff::LCS is a port of Algorithm::Diff that uses the McIlroy-Hunt longest
common subsequence (LCS) algorithm to compute intelligent differences between
two sequenced enumerable containers.

WWW: http://rubyforge.org/projects/ruwiki/

PR:		ports/128591
Submitted by:	Daniel Roethlisberger <daniel at roe.ch>
2008-11-09 23:00:51 +00:00
Jeremy Chadwick
6c8915f368 This library provides Boyer-Moore and Knuth-Morris-Pratt string search
implementations for Haskell bytestrings (with support for all strict and
lazy ByteString permutations).

WWW: http://hackage.haskell.org/cgi-bin/hackage-scripts/package/stringsearch

Submitted by:	sbahra on EFnet #bsdports
2008-11-02 01:32:14 +00:00
Sergey Skvortsov
2c9117b75e Add p5-XML-Atom-Ext-OpenSearch 0.01, XML::Atom extension for OpenSearch
data.
2008-10-27 11:42:47 +00:00
Martin Wilke
b10df17072 Kwalify is a parser, schema validator, and data binding tool for
YAML and JSON.
This package provides a Perl 5 implementation of Kwalify.

WWW: http://search.cpan.org/dist/Kwalify/
2008-10-24 09:35:07 +00:00
Lars Thegler
f60dcf4170 Add p5-XML-Rules 1.05, parse XML and specify rules for individual tags. 2008-10-14 11:52:13 +00:00
Martin Wilke
4488490785 - Connect textproc/xerces-c2-devel 2008-10-12 11:13:31 +00:00
Stanislav Sedov
1c98d18014 - Add textproc/ruby-liquid, a non evaling template engine for ruby. 2008-10-06 14:24:47 +00:00
Lars Thegler
d72c21f098 Add p5-pod2pdf 0.42, converts POD to PDF format. 2008-10-02 11:51:06 +00:00
Martin Wilke
a022fa80ec Amberfish is general purpose text retrieval software, developed at Etymon
by Nassib Nassar and distributed as open source software under the terms
of version 2 of the GNU General Public License (GPL). Its distinguishing
features are indexing/search of semi-structured text (i.e. both free tex
and multiply nested fields), built-in support for XML documents using the
Xerces library, structured queries allowing generalized field/tag paths,
hierarchical result sets (XML only), automatic searching across multiple
databases (allowing modular indexing), TREC format results, efficient
indexing, and relatively low memory requirements during indexing (and the
ability to index documents larger than available memory). Z39.50 support
is available. Other features include Boolean queries, right truncation,
phrase searching, relevance ranking, support for multiple documents per
file, incremental indexing, and easy integration with other UNIX tools,
The architecture is also designed to permit proximity queries; however,
they are not fully implemented at present.

WWW: http://www.etymon.com/tr.html

This port also includes the Porter stemming algorithm for suffix
stripping, available at:
     http://www.tartarus.org/~martin/PorterStemmer

PR:		ports/127580
Submitted by:	Pedro Giffuni
2008-09-30 14:19:57 +00:00
Martin Wilke
95ee2714b0 Text::WikiCreole implements the Wiki Creole markup language, version
1.0, as described at http://www.wikicreole.org.  It reads Creole 1.0
markup and returns XHTML.

In addition to the official Creole 1.0 markup elements, it also supports
several extensions, such as plugins, superscript, subscript, underline,
definition lists, indented paragraphs, plugins, etc.

WWW: http://search.cpan.org/dist/Text-WikiCreole/

PR:		ports/127705
Submitted by:	Matthew Fuller <fullermd at over-yonder.net>
2008-09-30 08:48:16 +00:00
Max Brazhnikov
9445c3e0fc Add textproc/scim-bridge and textproc/scim-bridge-qt4:
Scim-bridge is wrapper library for SCIM, written in C. Scim-bridge
is seperated in two parts, the agent and the clients. The agent is
the IME server which communicates with SCIM. The clients are IMModules
which communicate only with the agent so that there is no binary
dependency between the clients and SCIM.

WWW:	http://www.scim-im.org/projects/scim_bridge/

PR:		126603
Submitted by:	Henry Hu <henry.hu.sh at gmail.com>
Approved by:	miwi (mentor)
2008-09-23 15:16:58 +00:00
Edwin Groothuis
cabd6ca967 [new port] Add textproc/stardict3
Add new port textproc/stardict3 (update stardict-2.x to
	stardict-3.x).

	StarDict is a Cross-Platform and international dictionary
	written in Gtk2.  It has powerful features such as "Glob-style
	pattern matching," "Scan selection word," "Fuzzy query,"
	etc.

	WWW: http://stardict.sourceforge.net/

PR:		ports/125924
Submitted by:	"Eric L. Chen" <d9364104@mail.nchu.edu.tw>
2008-09-23 02:35:09 +00:00
Li-Wen Hsu
b652cfa177 Add py-sphinx 0.4.2, python documentation generator. 2008-09-07 19:38:48 +00:00
Chin-San Huang
d2d421fd99 Add p5-Text-CharWidth, get number of occupied columns of a string on terminal.
This module supplies features similar as wcwidth(3) and wcswidth(3) in C
language.

Characters have its own width on terminal depending on locale. For example,
ASCII characters occupy one column per character, east Asian fullwidth
characters (like Hiragana or Han Ideograph) occupy two columns per
character, and combining characters (apperaring in ISO-8859-11 Thai,
Unicode, and so on) occupy zero columns per character. mbwidth() gives the
width of the first character of the given string and mbswidth() gives the
width of the whole given string.

The names of mbwidth and mbswidth came from "multibyte" versions of wcwidth
and wcswidth which are "wide character" versions.

mblen(string) returns number of bytes of the first character of the string.
Please note that a character may consist of multiple bytes in multibyte
encodings such as UTF-8, EUC-JP, EUC-KR, GB2312, or Big5.

mbwidth(string) returns the width of the first character of the string.
mbswidth(string) returns the width of the whole string.

Parameters are to be given in locale encodings, not always in UTF-8.

WWW: http://search.cpan.org/dist/Text-CharWidth/
2008-08-31 08:31:02 +00:00
Chin-San Huang
7c5c013873 Add p5-Text-Flow, flexible text flowing and word wrapping for not just ASCII
output.

This module provides a flexible way to wrap and flow text for both ASCII and
non-ASCII outputs.

The main purpose of this module is to provide text wrapping and flowing
features without being tied down to ASCII based output and fixed-width
fonts. My needs were for a more sophisticated text control in PDF and GIF
output formats in particular.

WWW: http://search.cpan.org/dist/Text-Flow/
2008-08-30 17:16:29 +00:00
Cheng-Lung Sung
e2f9df4db8 - Perl extension for manipulating the Penn Treebank format 2008-08-27 03:04:09 +00:00
Pietro Cerutti
9aac2f9850 New port: es-ispell
Spanish dictionary for the ispell spell-checking program.

PR:		125838
Submitted by:	Pedro F. Giffuni <giffunip@tutopia.com>
2008-08-19 14:55:26 +00:00