Indexed Search modifications for support of cronjob based indexing. More to come...
[Packages/TYPO3.CMS.git] / typo3 / sysext / indexed_search / doc / TODO.txt
1 ***************
2 TODO / projects:
3 ***************
4
5 Version 3:
6 - Refactor "class.crawler.php"
7 - Add timing controls
8 - Hook + example for external clients
9 - IndexCfg option:
10 - Exclude URLs (regex?), possibly with graphical interface selecting it from indexed content.
11 - Record-indexing: support languageField in records
12 - Crawling in the night (enter time for indexing configurations), but possible to force.
13 - New type: Page Tree (traversing a part at night like crawler allows us...)
14 - Indexing configuration overview (including status and manual clearing possibility)
15 - Set up test environment with CLI running
16
17 Frontend:
18 - Searching in certain external site
19 - Mac "spotlight" like searching? (Define which categories are bundled and which not)
20 - Alternative presentationer af når records er indexerede.
21
22
23
24 Testing indexing crawler for:
25 - 3DS
26 - TYPO3.org copy
27 - Metropol
28 - FI
29 - Link Factory
30 - Brunata
31
32 **************
33
34 Bugs / Issues:
35 - The checkbox "No Search" in the page header is only respected by indexed_search during indexing! (A page will not be indexed when "No Search" is set). However when searching results are not filtered based on this flag - so if a page is indexed before the no search flag is set it will be found in search results. To change this is hard because the getTreeList() function that fetches all page ids cannot take a where-clause to filter it out but must have hardcoded support. Alternatively the pages table must be joined into the search result so we can select on the field. A solution is still not agreed upon.
36 - For tt_news with access restricted records: don't show the title of page since it can reveal information
37 - SOLUTIONS: Maybe just hide search results where "resume" is normally just not shown?
38 - When there is a page where *content* is access restricted (eg. from a plugin) while the page itself is not, TYPO3 will still display it as a search result (not the description of course, but the title will be revealed); there should be a flag that the plugin can set so the indexer knows that the page as a whole should be indexed as if it was completely access restricted. Or maybe _all_ search results which are NOT indexed under "0,-1" should be hidden? (Reported by Lars Houmark <lars@houmark.com>)
39 - Seems that external media / languages are implemented buggy. See mail from Gert Thiel <GertThiel@gmx.net>, 24/2 2005
40
41 Errors encountered after spidering, maybe check:
42 - testsite: "message appears" - viste external media som IKKE var indekseret!?
43 - typo3site_live: Warning: phash-row "114682730" didn't have a representation in the index_section table! on references page!
44
45 Search test:
46 - external media respect privacy of pages?
47 - external media on multiple pages with DIFFERENT languages?
48
49 Templating / Display in plugin:
50 - Support for FE visning af resultater i extra niveaer (ud over niv. 1,2 som er hardcoded)?
51 - Configurable language parameter (hardcoded to "L" now)
52
53 Indexing configurations:
54 - Tabelvælger som en del af sektionsvælgeren i frontend
55 Config i backend through flexforms:
56 - baseUrl for external files?
57 - language setting for files and external URLs?
58
59 CLI feature ideas:
60 - Removal of old indexes
61 - delete results with large tstamp (thats all....)
62
63 Backend modules:
64 - Much nicer detail display
65 - Proper skinning? / getLL? / XHTML
66 - The Tools>Indexing module could need some shining up and more useful features (Someone else does this?)
67
68 Ideas:
69 - (Jan Slusarczyk <janslu@grupaiis.pl>, 26/11 2004): Searchterms matching exact keywords on pages shows a special result/shortcut on top of result page?
70 - Implement that extended chars are translated: ü => u, ç => c, etc. Thus "Français" will be found when "Francais" is searched for.
71
72 Hook development:
73 - Example of search-SQL hook
74
75 Documentation:
76 - Configuration possibilities (piVars, TypoScript, Hooks etc)
77 - How to setup up, analyse and debug indexed search (manual)
78 - Technical:
79 - utf-8 internally.
80 - Updates on tables structure
81