Loading...
 
Tiki Importer

Tiki Importer


Re: Re: Re: Mediawiki importer

Brazil

Hi Ben,

I just added support for Mediawiki XML files 0.4 on trunk. For more information I'm copying a message I send to another developer on the Tiki devel list (he was also asking about support for version 0.4).

I suggest you try to import your Mediawiki file running Tiki trunk. Let me know if you need more information on how to get Tiki trunk running.

Cheers, Rodrigo.

Hi Jonny,

I guess I'm using schemaValidate() instead of validate() because the later generates a "no DTD found" error. I haven't investigate to understand why.

See r30477 it adds support for the Mediawiki XML file version 0.4. Apparently there is some validation problems in the DTD of the version 0.4. See the Mediawiki bug I have reported https://bugzilla.wikimedia.org/show_bug.cgi?id=25753

Now you should be able to import the file you were using to test http://en.wikipedia.org/wiki/Special:Export/Train

I have tested and I got some weird results. A big portion of this Wikipedia article is not displayed. I have checked and apparently its content is correctly parsed by TikiImporter_Wiki_Mediawiki::convertMarkup() (see test testConvertMarkupParserWikipediaSamplePage) and correctly added to the tiki_pages table. So apparently the problem is in Tiki parser when we try to display the page. I haven't checked more than this.

This problem might be related with the fact that in this article (and in most Wikipedia articles) the sintax is used a lot. Text_Wiki ignore this sintax. A solution might be to change Text_Wiki Tiki renderer to add ~np~ when rendering .

I'm not planning to put more time on this in the near future but let me know if you need any help.

Cheers, Rodrigo.

There are no comments at this time.