pythonmat-file

How can I convert complex .mat file to csv using python


I am trying to convert a .mat file to csv using python. The code I am using is

import scipy.io
import numpy as np

data = scipy.io.loadmat("wiki.mat")

for i in data:
    if '__' not in i and 'readme' not in i:
        np.savetxt(("file.csv"),data[i],delimiter=',')

When ever I run this code, I get error as follows:

Traceback (most recent call last):
  File "test.py", line 8, in <module>
    np.savetxt(("file.csv"),data[i],delimiter=',')
  File "/Library/Python/2.7/site-packages/numpy/lib/npyio.py", line 1258, in savetxt
    % (str(X.dtype), format))
TypeError: Mismatch between array dtype ('[('dob', 'O'), ('photo_taken', 'O'), ('full_path', 'O'), ('gender', 'O'), ('name', 'O'), ('face_location', 'O'), ('face_score', 'O'), ('second_face_score', 'O')]') and format specifier ('%.18e')

I am trying to convert the .mat file from this link: https://data.vision.ee.ethz.ch/cvl/rrothe/imdb-wiki/static/imdb_meta.tar

Please help me out with some working solution!


Solution

  • https://docs.scipy.org/doc/numpy-1.13.0/reference/generated/numpy.savetxt.html

    Save an array to a text file.

    You can, unfortunately, only store a single numeric numpy array in a single file. Whereas your .mat file contains a structure:

    >> fieldnames(imdb)
                            ans = 
                            {
                              [1,1] = dob
                              [2,1] = photo_taken
                              [3,1] = full_path
                              [4,1] = gender
                              [5,1] = name
                              [6,1] = face_location
                              [7,1] = face_score
                              [8,1] = second_face_score                          
                              [9,1] = celeb_names
                              [10,1] = celeb_id
                            }
    >> imdb.name(1)        
                            ans = 
                            {
                              [1,1] = Fred Astaire
                            }
    

    It might make sense to convert the data to a numpy dictionary (as described in "Complex matlab-like data structure in python (numpy/scipy)"), and store that as a .csv using How do I convert this list of dictionaries to a csv file? [Python]