HPAT unable to handle non-existent dataframe index name. #132

triskadecaepyon · 2019-09-03T17:23:09Z

HPAT crashes if it gets a Pandas dataframe with non-existent dataframe index. Many users create ones without such indexes (as is shown in Pandas docs), so it would be useful to fix this bug.

@akharche perhaps the changes to indexing made this behavior happen?

import pandas as pd
import numpy as np
import hpat

df = pd.DataFrame({'0':[100,200,300,400,200,100]})
df2 = pd.DataFrame([100,200,300,400,200,100])
df3 = pd.DataFrame({'A':[100,200,300,400,200,100]})

@hpat.jit
def test_func(data_frame):
    return data_frame

test_func(df) # works with warnings
test_func(df2) # fails and crashes kernel
test_func(df3) # works with warnings

The text was updated successfully, but these errors were encountered:

akharche · 2019-09-05T10:28:14Z

Thank you for the example. I guess this issue is not connected to non-existent dataframe index (I mean the additional column "Index").
Now Hpat is limited in handling DataFrames created such way df2 = pd.DataFrame([100,200,300,400,200,100]), Hpat expects column name like df2 = pd.DataFrame([100,200,300,400,200,100], columns=['A']) or a dictionary like in your examples.
It is mentioned in Hpat docs.

We will add ability to handle such cases.

triskadecaepyon · 2019-09-06T09:09:10Z

So an additional thought then since we are adding this ability:

Is there a way we can elegantly handle unsupported features in the future? As in throw a warning, don't continue, leave as Python Object code?

fschlimb · 2019-09-06T09:18:40Z

It is of course possible to just not compile anything and leave the entire function uncompiled.
It's less straight forward to partially fall back to object mode. It is conceptually possible for certain functions, e.g. those which are trivially data parallel. There is no default fallback for others, like reductions, joins etc. The situation is different if we separated distribution from compilation: going to object-mode in (a hypothetical) non-distributed mode should always be possible.

akharche mentioned this issue Sep 6, 2019

Add tests create dataframe #136

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HPAT unable to handle non-existent dataframe index name. #132

HPAT unable to handle non-existent dataframe index name. #132

triskadecaepyon commented Sep 3, 2019

akharche commented Sep 5, 2019 •

edited

Loading

triskadecaepyon commented Sep 6, 2019

fschlimb commented Sep 6, 2019

HPAT unable to handle non-existent dataframe index name. #132

HPAT unable to handle non-existent dataframe index name. #132

Comments

triskadecaepyon commented Sep 3, 2019

akharche commented Sep 5, 2019 • edited Loading

triskadecaepyon commented Sep 6, 2019

fschlimb commented Sep 6, 2019

akharche commented Sep 5, 2019 •

edited

Loading