-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ArrayIndexOutOfBoundsException in make sql-dump-parts #18
Comments
Looks like the xml file is malformed or misread. Could you post the output of
and also the contents of EDIT: also the outputs of
and
Might be useful. |
Thanks for following up. I've included the output you requested. The output is pretty noisy, let me know if you want me to be more specific with grep.
|
Just so this is documented and you are not completely in the dark: Due to a bug(?) in mwdumper when feeding it the xml expecting sql sometimes xerces (the xml parser) throws the exception that you saw, namely
I found that removing the offending The way I went around implementing this is: the downloaded xml gets parsed into another xml by mwdumper. If this process fails we look backwards into the output xml file for a The problem is almost definitely with my code, I will take a look at it shortly. |
We have encountered the same error. |
@dldharma just so I don't have to download everything from scratch, do you have it on an infolab machine? |
Unfortunately, not on infolab machine. Sent from my iPhone
|
@dldharma this project is now mostly defunct -- we had a lot of problems setting up a full mirror of Wikipedia. Check out WikipediaBase, a virtual database that uses a combination of local data obtained from the Wikipedia dumps and live fetched from the Wikipedia API. |
Thanks a lot for your prompt reply. Appreciate. Regards, On Fri, Sep 30, 2016 at 7:53 AM, Michael Silver [email protected]
|
@michaelsilver the import process successfully completed. It populated Articles, Classes and Article Classes mappings. Thanks once again for sharing WikipediaBase. On reviewing the data, found that article categories are present only in Article.markup. Regards, |
@dldharma why don't you make an issue in WikipediaBase and we can discuss further there. When you create the issue, please provide a printout of the tables you have populated ( |
@michaelsilver agree. Created issue 277. Thanks once again for your prompt replies. Appreciate ! |
I'm installing the 2015-06-02 dumps. I got an error in the
make sql-dump-parts
step. Parts 1-26 completed successfully, but the 27th file did not. I'm opening an issue, as instructed below.The text was updated successfully, but these errors were encountered: