5 Questions 17 Answers 0 Followers
Questions related from Tony C. Scott
Hello - I need to apply multi-threading to a subroutine in python on spark. Any feedback or references w.r.t. getting a fast and efficient outcome would be greatly appreciated. FYI the subroutine...
05 May 2018 5,406 4 View
We are building a unique ambient data platform at Near.co in India, that is a) truly global in scale b) solving some really core problems with highly disparate data sets and c) creating products...
09 September 2017 5,164 3 View
I have been working with Python's package for sparse matrices (scipy.sparse) which boast a number of sparseness structures: CSR, CSC, LIL, COO, etc... I find that certain operations are faster in...
09 September 2017 6,962 0 View
I downloaded adroll/cantor to experiment getting hash counts of dataset intersections. It uses Hyperloglog (HLL) + minhash but I find the test cases insufficient. I need to know if ALL...
06 June 2016 531 5 View
Examples can be molecular dynamics, chemical reactions, particle physics, astronomy, even weather patterns. It's the data in flat-form, e.g. spreadsheets that I need.Attached is an example for...
11 November 2015 599 3 View