Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: load_dataset: Some datasets raise a python error #328

Open
khanlou opened this issue Dec 21, 2024 · 0 comments
Open

[Bug]: load_dataset: Some datasets raise a python error #328

khanlou opened this issue Dec 21, 2024 · 0 comments
Labels
bug Something isn't working community pgai

Comments

@khanlou
Copy link

khanlou commented Dec 21, 2024

What happened?

I was able to get the docker container loaded and running, and while some Hugging Face datasets load fine (like AWeirdDev/airports), I have one (very large) dataset that I'm having trouble getting imported: foursquare/fsq-os-places.

When I try to import it, here's what I see:

postgres=# select ai.load_dataset('foursquare/fsq-os-places', table_name => 'hf_places');
ERROR:  AttributeError: 'dict' object has no attribute 'dtype'
CONTEXT:  Traceback (most recent call last):
  PL/Python function "load_dataset", line 28, in <module>
    return ai.load_dataset.load_dataset(
  PL/Python function "load_dataset", line 194, in load_dataset
  PL/Python function "load_dataset", line 39, in get_column_info
  PL/Python function "load_dataset", line 39, in <dictcomp>
PL/Python function "load_dataset"

It appears to be some kind of object/dictionary mixup?

I also tried using load_dataset_multi_txn, which results in the same error.

pgai extension affected

0.6.0

pgai library affected

No response

PostgreSQL version used

17.2

What operating system did you use?

Ubuntu 24.04.1

What installation method did you use?

Docker

What platform did you run on?

Other

Relevant log output and stack trace

No response

How can we reproduce the bug?

Run `select ai.load_dataset('foursquare/fsq-os-places', table_name => 'hf_places');`

Are you going to work on the bugfix?

🆘 No, could someone else please work on the bugfix?

@khanlou khanlou added bug Something isn't working community pgai labels Dec 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working community pgai
Projects
None yet
Development

No branches or pull requests

1 participant