History: TikiWikifier
Preview of version: 3
Overview
As u may guess from title ๐ Tiki Sucker is a feature to convert remote files (is there need to convert local files?) from HTML into wiki page.
What Has Been Implemented
- Look at wiki edit form for 'Import HTML' field. Type here URL (with http:// prefix) of page u want to import.
- Set 'Try to convert HTML to wiki' as needed
- If set Tiki Sucker will parse HTML and generate wiki page with wiki syntaxes of corresponding HTML tags
- If blank Tiki Sucker will append pure HTML text to the end of current page
- Recognized wiki syntaxes
[+]
Future Plans
- This is rapid (sometimes ugly) implementation of wiki HTML import. I think it should be plugable import/export converters with ability to choose from edit form what converter to use for operation.
- In case of import converter, it can try to guess what source file is (using mime type from remote server response).
- Of couse converters may have itsown configuration panel what should be pluged into wiki page edit form too... (like HTML importer may have flags 'strip HTML tags' and 'try to convert HTML to wiki' ๐
- At least one export filter for wiki already exists ๐ — PDF exporter...
- Need to code converter HTML specials (like and so on) into wiki
- it could be one
preg_replace()
with arrays of search/replace ๐
- it could be one
- Need to verify/fix HTML parser grammar file! As I saw it lack much of tags/attributes! (so runtime PHP errors may occur on import) — Smbd with HTML docs/specs who can do this? Any volunteer? ๐
- Maybe it is resonable to introduce 'strict' parsing mode? (to reduce noice anount on converted pages)
- i.e. remove all empty lines after parse (convert)
- join lines during parce process
- line breaks on output wiki page may occur only on explicit
<br>
,<p>
or after some directives (like headers and list items) - should it check for wiki syntax errors after generate wiki page? like multilined bold/italic/etc. (if smbd don't know it is one-liners)