How to efficiently calculate 160146 by 160146 matrix inverse in python?

08 August 2017 6 8K Report

Hello,

My research is into structural dynamics and i am dealing with large symmetric sparse matrix calculation. Recently, i have to calculate the stiffness matrix (160146 by 160146) inverse with 4813762 non zero elements. I did calculate a smaller stiffness matrix inverse for a 15000 by 15000 size and it came out to almost or full dense. Initially i tried with almost all scipy.sparse.linalg functions to calculate inverse through Ax=b form. Currently, i am using superlu to calculate the L and U matrices then using that i calculate the inverse using Solve(). Since the matrix inverse is dense and i could not store in RAM memory i opted for pytables.

Unfortunately, writing time of one column of inverse matrix takes about 16 minutes(time for each step is shown below after the code) and a total of 160146 columns exist for the stiffness matrix. I would like to know how i can boost the writing speed so that this inverse task will finish in couple of days. The code is as follows,

LU= scipy.sparse.linalg.splu(interior_stiff)

#---PyTables creation Code for interior_stiff_inverse begins--#

if(os.path.isfile("HDF5_Interior.h5")==False):

f=tables.open_file("HDF5_Interior.h5", 'w')

# compression-level and compression library

filters=tables.Filters(complevel=0, complib='blosc')

# f.root-> your default group in the HDF5 file "firstdata"->name of the dataset

# tables.Float32Atom()->WHat si your atomic data object?

if(f.__contains__("/DS_interior_stiff_inverse")==False):

print("DS_interior_stiff_inverse EXISTS!!!!!")

out=f.create_carray(f.root, "DS_interior_stiff_inverse", tables.Float32Atom(), shape=(interior_dof_row_ptr,interior_dof_row_ptr), filters=filters)

#out=f.create_earray(f.root, "DS_interior_stiff_inverse", tables.Float32Atom(), shape=(interior_dof_row_ptr,0), filters=filters, expectedrows=interior_dof_row_ptr)

else:

print("DS_Interior_stiff_inverse DOESN'T EXIST!!!!!")

out=f.get_node("/", "DS_interior_stiff_inverse")

#interior_stiff_inverse=numpy.zeros((interior_dof_row_ptr,interior_dof_row_ptr))

for i in range(0,interior_dof_row_ptr):

I=numpy.zeros((interior_dof_row_ptr,1))

I[i,0]=1

#-- COmmented by Libni - interior_stiff_inverse[:,i]=LU.solve(I[:,0]) #In pytables how we define the variables. So interior_stiff_inverse_1 only needs to be stored in pytables.

print("stating solve() calculation for inverse: ", datetime.datetime.now())

tmpResult=LU.solve(I[:,0])

print("solve() calculation for inverse DONE: ", datetime.datetime.now())

out[:,i]=tmpResult

print("Written to hdf5 (pytable) :", datetime.datetime.now())

#out.append(LU.solve(I[:,0]))

print(str(i) + "th iteration of " + str(interior_dof_row_ptr) + " Interior Inv done")

f.flush()

print("After FLUSH line: ", datetime.datetime.now())

f.close()

#--***PyTables creation Code for interior_stiff_inverse begins-***

LU= scipy.sparse.linalg.splu(interior_stiff)

#---PyTables creation Code for interior_stiff_inverse begins--#

if(os.path.isfile("HDF5_Interior.h5")==False):

f=tables.open_file("HDF5_Interior.h5", 'w')

# compression-level and compression library

filters=tables.Filters(complevel=0, complib='blosc')

# f.root-> your default group in the HDF5 file "firstdata"->name of the dataset

# tables.Float32Atom()->WHat si your atomic data object?

if(f.__contains__("/DS_interior_stiff_inverse")==False):

print("DS_interior_stiff_inverse EXISTS!!!!!")

out=f.create_carray(f.root, "DS_interior_stiff_inverse", tables.Float32Atom(), shape=(interior_dof_row_ptr,interior_dof_row_ptr), filters=filters)

#out=f.create_earray(f.root, "DS_interior_stiff_inverse", tables.Float32Atom(), shape=(interior_dof_row_ptr,0), filters=filters, expectedrows=interior_dof_row_ptr)

else:

print("DS_Interior_stiff_inverse DOESN'T EXIST!!!!!")

out=f.get_node("/", "DS_interior_stiff_inverse")

#interior_stiff_inverse=numpy.zeros((interior_dof_row_ptr,interior_dof_row_ptr))

for i in range(0,interior_dof_row_ptr):

I=numpy.zeros((interior_dof_row_ptr,1))

I[i,0]=1

#-- COmmented by Libni - interior_stiff_inverse[:,i]=LU.solve(I[:,0]) #In pytables how we define the variables. So interior_stiff_inverse_1 only needs to be stored in pytables.

print("stating solve() calculation for inverse: ", datetime.datetime.now())

tmpResult=LU.solve(I[:,0])

print("solve() calculation for inverse DONE: ", datetime.datetime.now())

out[:,i]=tmpResult

print("Written to hdf5 (pytable) :", datetime.datetime.now())

#out.append(LU.solve(I[:,0]))

print(str(i) + "th iteration of " + str(interior_dof_row_ptr) + " Interior Inv done")

f.flush()

print("After FLUSH line: ", datetime.datetime.now())

f.close()

#--***PyTables creation Code for interior_stiff_inverse begins-***

Time taken for Solve () calculation and writing to hdf5 is as follows,

stating solve() calculation for inverse: 2017-08-26 01:04:20.424447

solve() calculation for inverse DONE: 2017-08-26 01:04:20.596045

Written to hdf5 (pytable) :2017-08-26 01:20:57.228322

After FLUSH line: 01:20:57.555922

which clearly indicate that writing one column of inverse matrix to hdf5 takes 16 minutes. As per this if i need to calculate the entire matrix inverse it will take me 1779 days. I am sure the writing time can be boosted up. I dont know how i can achieve this. Please help me in boosting the writing speed to hdf5 so that the matrix inverse run can be finished within couple of days.

I have used 0 compression in hdf5 creation thinking that this will be helping in reading and writing fast. My computer spec include i7 with 4 cores and 16 RAM.

Any help will be appreciated.

Thank You, Paul Thomas

Paul Thomas

Sorry for posting the code portion in a untidy manner. I dont know how to make the code appear in a proper way as in the stackoverflow website. I have posted the same question in https://stackoverflow.com/questions/45892604/how-to-efficiently-calculate-160146-by-160146-matrix-inverse-in-python

It is mainly about boosting the writing speed to hdf5 using pytables python.

Thank you for understanding

Exequiel Manuel Sepúlveda

You probably need and out-of-core method (to cope with such large matrix). I suggest a iterative method such as Jacobi Method. Have a look to this presentation:

https://www5.in.tum.de/forschung/simlab/course2010_files/course_mat/2010-10-04-gatzhammer-SimLabIterative.pdf

Hello All,

Thank You for your answers. I have resolved the problem.

Regards,

PAul

Great! Could you tell us how you solved it?

For the problem, i was dealing with i had a walk around instead of calculating the inverse. For example, T=K1^(-1)*K2. This problem can be written as K1*T=K2. I made a bad choice to calculate the matrix inverse first then multiply with K2. Instead i solved it like Ax=b form, that is A=K1 and b=K2. Even for a large matrix this can be done easily using spsolve() in python or \ in matlab.

Zhou Champin

Great ,thank you very much.

How to find the eigenvectors within a specific eigenvalue range for a large sparse symmetric matrix (162000 by 162000)?

Python Scipy.sparse RuntimeError: Factor is exactly singular

How to convert a dense 12 by 12 stiffness matrix into standard beam element stiffness matrix format?

Why the first 6 rigid body modes are not zero in a Free- Free analysis?

How to find beam properties like moment of inertia, polar moment of inertia, E, G, A from its 3D beam element stiffness matrix?

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

Request Python code?

Why does everyone use vs code?

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

Need help with my research project on open source SIEM and machine learning?

How to do FEL analysis?

How to solve 3D tetrahedral traveltime tomography inversion problem?

Mass spectra averaging algorithm?

Is there anything faster than Xarray or Pandas out there?

How can I extract the mathematical equation from existing Neural Network Model?