Add textproc/py-RTFDE: Library for extracting HTML content from RTF encapsulated HTML

RTFDE is a python3 library for extracting encapsulated HTML & plain
text content from the RTF bodies of .msg files.

De-encapsulation enables previously encapsulated HTML and plain
text content to be extracted and rendered as HTML and plain text
instead of the encapsulating RTF content. After de-encapsulation,
the HTML and plain text should differ only minimally from the
original HTML or plain text content.

WWW: https://github.com/seamustuohy/RTFDE

PR:		265764
This commit is contained in:
Jesús Daniel Colmenares Oviedo 2022-09-23 13:54:09 -04:00 committed by Li-Wen Hsu
parent 54b158cbe6
commit f6c58e8709
No known key found for this signature in database
GPG key ID: 82B261B14D3BC7AF
4 changed files with 37 additions and 0 deletions

View file

@ -1272,6 +1272,7 @@
SUBDIR += py-CommonMark
SUBDIR += py-Morfessor
SUBDIR += py-QDarkStyle
SUBDIR += py-RTFDE
SUBDIR += py-Tempita
SUBDIR += py-acora
SUBDIR += py-aeidon

View file

@ -0,0 +1,25 @@
PORTNAME= RTFDE
PORTVERSION= 0.0.2
CATEGORIES= textproc python
MASTER_SITES= CHEESESHOP
PKGNAMEPREFIX= ${PYTHON_PKGNAMEPREFIX}
MAINTAINER= DtxdF@disroot.org
COMMENT= Library for extracting HTML content from RTF encapsulated HTML
WWW= https://github.com/seamustuohy/RTFDE
LICENSE= LGPL3
RUN_DEPENDS= ${PYTHON_PKGNAMEPREFIX}lark>=0.11:devel/py-lark@${PY_FLAVOR} \
${PYTHON_PKGNAMEPREFIX}oletools>=0.56:devel/py-oletools@${PY_FLAVOR}
USES= python:3.6+
USE_PYTHON= autoplist distutils
OPTIONS_DEFINE= LXML
LXML_DESC= Pythonic binding for the libxml2 and libxslt libraries
LXML_RUN_DEPENDS= ${PYTHON_PKGNAMEPREFIX}lxml>=4.6:devel/py-lxml@${PY_FLAVOR}
.include <bsd.port.mk>

View file

@ -0,0 +1,3 @@
TIMESTAMP = 1659984075
SHA256 (RTFDE-0.0.2.tar.gz) = b86b5d734950fe8745a5b89133f50554252dbd67c6d1b9265e23ee140e7ea8a2
SIZE (RTFDE-0.0.2.tar.gz) = 18891

View file

@ -0,0 +1,8 @@
RTFDE is a python3 library for extracting encapsulated HTML & plain
text content from the RTF bodies of .msg files.
De-encapsulation enables previously encapsulated HTML and plain
text content to be extracted and rendered as HTML and plain text
instead of the encapsulating RTF content. After de-encapsulation,
the HTML and plain text should differ only minimally from the
original HTML or plain text content.