Pandas SparseDataFrame from dicts list

Question

Pandas SparseDataFrame from dicts list

I'm trying to convert a Python dicts list to Pandas DataFrame

. Since each dict has different keys, it takes up too much memory. Since most values are NaN, a should be useful in this case SparseDataFrame

.

import pandas

df = pandas.DataFrame(keyword_data).to_sparse(fill_value=.0)

This works, but it takes up a lot of memory because the DataFrame is being created at the same time and sometimes it is MemoryError

.

Is it possible to create a SparseDataFrame with this data without this step? The Pandas documentation is of little help in this case ... Doing this:

pandas.SparseDataFrame(keyword_data, default_fill_value=.0)

Raises:

TypeError: ufunc 'isnan' is not supported for input types and inputs cannot be safely bound to any supported types according to the casting rule `` safe ''

The data looks something like this:

[{'a': 0.672366,
  'b': 0.667276,
  # ...
 },
 {'c': 0.507752,
  'd': 0.532593,
  'e': 0.507793
  # ...
 },
 # ...
]

Keys are always strings, with different dictaphone keys, values are floating point.

Is there a way to create SparseDataFrame

directly from this data without going through a regular one DataFrame

?

+3

python numpy pandas

yprez 29 oct. '14 at 9:41

source to share

No one has answered this question yet

Check out similar questions:

3474

How to list all files in a directory?

3235

How to check if a list is empty?

3119

What is the difference between Python list methods that are appended and expanded?

2849

How to make a flat list from a list of lists?

2818

Finding the index of an element by specifying the list that contains it in Python

1553

Renaming columns in pandas

1419

Select rows from DataFrame based on values in column in pandas

1033

Remove column from panda DataFrame

879

Get list from pandas DataFrame column headers

873

Big data workflows using pandas

Pandas SparseDataFrame from dicts list

More articles: