Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hosting Mongo server for data for this week challenge #3

Open
ianks opened this issue Nov 14, 2014 · 21 comments
Open

Hosting Mongo server for data for this week challenge #3

ianks opened this issue Nov 14, 2014 · 21 comments

Comments

@ianks
Copy link

ianks commented Nov 14, 2014

Due to the size of the data in this challenge, I figured some people would have some issues getting it indexed/hosted in Mongo in a reasonable time. I went ahead and started a Mongo server with all of the data hosted.

You can access it like so:
$ mongo --host 198.199.113.194

@ianks
Copy link
Author

ianks commented Nov 14, 2014

cc: @CSCI-4830-002-2014/students

@BrianNewsom
Copy link

Thanks Ian! This is great. Also I just used

$ mongo 198.199.113.194

to access it, the -h flag brought up help for my windows machine.

@ianks
Copy link
Author

ianks commented Nov 14, 2014

Good call! Updating post now.

@indiesquidge
Copy link

It was working moments ago just as mongo 198.199.113.194, but now it won't work at all. I get the error:

MongoDB shell version: 2.6.5
connecting to: 198.199.113.194/test
2014-11-14T13:51:32.511-0700 warning: Failed to connect to 198.199.113.194:27017, reason: errno:61 Connection refused
2014-11-14T13:51:32.511-0700 Error: couldn't connect to server 198.199.113.194:27017
(198.199.113.194), connection attempt failed at src/mongo/shell/mongo.js:148
exception: connect failed

I think the update must have changed something. Are you sure the IP stayed the same?

@ianks
Copy link
Author

ianks commented Nov 14, 2014

I just upgraded mongo, for some reason the Debian package was at 2.0, now we at 2.6. Should be good now. Can you confirm it works?

@indiesquidge
Copy link

👍 Thanks!

@antsankov
Copy link

Awesome stuff!

@JoshFerge
Copy link

you da best

@alne4294
Copy link

Thank you!

@mynameisfiber
Copy link
Member

👍

@dawsbot
Copy link

dawsbot commented Nov 16, 2014

Can't...

> db.reddit.find({},{}).sort({ups:1})
error: {
        "$err" : "Runner error: Overflow sort stage buffered data usage of 33555127 bytes exceeds internal limit of 33554432 bytes",
        "code" : 17144

        "code" : 17144
}

@ianks
Copy link
Author

ianks commented Nov 16, 2014

@dawsonbotsford Try throwing a .limit(10) on there.

@mynameisfiber
Copy link
Member

You could always limit it to comments from one particular day in the dataset. This'll add potential biases that you can think about and find ways to get around (maybe redo the calculation for different days and check that the conclusions are approximately the same)

@dawsbot
Copy link

dawsbot commented Nov 16, 2014

Where do I find the weather data on this server?

@ianks
Copy link
Author

ianks commented Nov 16, 2014

I don't believe anyone had uploaded that yet. Feel free to do so in a new collection, though.

@dawsbot
Copy link

dawsbot commented Nov 17, 2014

Peyman added it in JSON format in collection weather

@dawsbot
Copy link

dawsbot commented Nov 17, 2014

How are we supposed to convert the csv to json? Peyman and I are dead in the water

@BrianNewsom
Copy link

Import it as CSV using --type CSV and --headerline
On Nov 16, 2014 6:58 PM, "Dawson Botsford" [email protected] wrote:

How are we supposed to convert the csv to json? Peyman and I are dead in
the water


Reply to this email directly or view it on GitHub
#3 (comment)
.

@dawsbot
Copy link

dawsbot commented Nov 17, 2014

Do we need to build our own headerline?

@BrianNewsom
Copy link

If you look at the file, the first line is a header...

http://docs.mongodb.org/manual/reference/program/mongoimport/

is what I referenced.

@ianks
Copy link
Author

ianks commented Nov 17, 2014

So I ended up booting an instance with 32CPUs and 32GB of memory to deal with these queries....

$ mongo --host 104.236.191.166

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants