[FEATURE] Get a file's text content if possible 56/36556/9
authorIngo Renner <ingo@typo3.org>
Sun, 1 Feb 2015 06:53:22 +0000 (22:53 -0800)
committerStefan Froemken <froemken@gmail.com>
Fri, 13 Mar 2015 08:42:08 +0000 (09:42 +0100)
[FEATURE] Get a file's text content if possible

Currently FAL only allows to extract meta data from files. This
patch allows to also extract text content from files. This can
be useful for search engines or providing snippets/teasers
in document archives.

Multiple text extractors can be registered to allow dealing with
different file types. A plain text extractor is provided by the core.

This is also a successor to the former textExtract service interface
implemented by several extensions: http://bit.ly/1D0x92M

Fixes: #36743
Releases: master
Change-Id: I1ce414c99fb26413eedd32422821e1a8802010de
Reviewed-on: http://review.typo3.org/36556
Reviewed-by: Frans Saris <franssaris@gmail.com>
Tested-by: Frans Saris <franssaris@gmail.com>
Reviewed-by: Frank Nägler <typo3@naegler.net>
Tested-by: Frank Nägler <typo3@naegler.net>
Reviewed-by: Stefan Froemken <froemken@gmail.com>
Tested-by: Stefan Froemken <froemken@gmail.com>
typo3/sysext/core/Classes/Resource/TextExtraction/PlainTextExtractor.php [new file with mode: 0644]
typo3/sysext/core/Classes/Resource/TextExtraction/TextExtractorInterface.php [new file with mode: 0644]
typo3/sysext/core/Classes/Resource/TextExtraction/TextExtractorRegistry.php [new file with mode: 0644]
typo3/sysext/core/Documentation/Changelog/master/Feature-36743-FAL-TextExtractorRegistry.rst [new file with mode: 0644]
typo3/sysext/core/Tests/Unit/Resource/TextExtraction/PlainTextExtractorTest.php [new file with mode: 0644]
typo3/sysext/core/Tests/Unit/Resource/TextExtraction/TextExtractorRegistryTest.php [new file with mode: 0644]