[FEATURE] Get a file's text content if possible 56/36556/9
authorIngo Renner <ingo@typo3.org>
Sun, 1 Feb 2015 06:53:22 +0000 (22:53 -0800)
committerStefan Froemken <froemken@gmail.com>
Fri, 13 Mar 2015 08:42:08 +0000 (09:42 +0100)
commitca87592e8c95a1b45d57dd9eab2ceb051dce88d9
treea283c7e8f04887732183286cde9b05906a1c5cd9
parent354beeab7074baa0c4eb2b4f30680ac93de078ef
[FEATURE] Get a file's text content if possible

Currently FAL only allows to extract meta data from files. This
patch allows to also extract text content from files. This can
be useful for search engines or providing snippets/teasers
in document archives.

Multiple text extractors can be registered to allow dealing with
different file types. A plain text extractor is provided by the core.

This is also a successor to the former textExtract service interface
implemented by several extensions: http://bit.ly/1D0x92M

Fixes: #36743
Releases: master
Change-Id: I1ce414c99fb26413eedd32422821e1a8802010de
Reviewed-on: http://review.typo3.org/36556
Reviewed-by: Frans Saris <franssaris@gmail.com>
Tested-by: Frans Saris <franssaris@gmail.com>
Reviewed-by: Frank Nägler <typo3@naegler.net>
Tested-by: Frank Nägler <typo3@naegler.net>
Reviewed-by: Stefan Froemken <froemken@gmail.com>
Tested-by: Stefan Froemken <froemken@gmail.com>
typo3/sysext/core/Classes/Resource/TextExtraction/PlainTextExtractor.php [new file with mode: 0644]
typo3/sysext/core/Classes/Resource/TextExtraction/TextExtractorInterface.php [new file with mode: 0644]
typo3/sysext/core/Classes/Resource/TextExtraction/TextExtractorRegistry.php [new file with mode: 0644]
typo3/sysext/core/Documentation/Changelog/master/Feature-36743-FAL-TextExtractorRegistry.rst [new file with mode: 0644]
typo3/sysext/core/Tests/Unit/Resource/TextExtraction/PlainTextExtractorTest.php [new file with mode: 0644]
typo3/sysext/core/Tests/Unit/Resource/TextExtraction/TextExtractorRegistryTest.php [new file with mode: 0644]
typo3/sysext/core/ext_localconf.php