Masab Ahmad

23 Questions 20 Answers 0 Followers

Questions related from Masab Ahmad

What percentage of program completion time is spent in synchronization on GPUs?

Assume a memory access bound workload such as graph analytics, machine learning, monte-carlo simulations etc. Assume a high-end single chip GPU of the current generation (2020).

02 February 2020 827 3 View

How do recent works against Spectre and other architectural attacks stack up in performance and usability?

Works such as Mi6, InvisiSpec, and IRONHIDE, and many other works, target secure isolation of target processes. How do these works rank and work in terms of performance, complexity, and usability?

05 May 2019 8,615 2 View

Has machine learning improved current weather modeling predictions?

Weather modeling is a prediction problem, and most models are not entirely accurate. Has machine learning improved current weather models? What has improved and how does it improve predictions?

04 April 2019 982 2 View

What are the implications of computing on today's environment?

Computing takes significant power, resources, and man-power. What are the environmental implications of computing in this context?

04 April 2019 2,888 3 View

What type(s) of machine learning or AI algorithms are considered most dangerous for automation of jobs?

Automation of various jobs is underway. What types of machine learning algorithms are used to replace human work?

04 April 2019 5,732 5 View

How does Bayesian inference compare against other machine learning models?

Bayesian inference is a machine learning model not as widely used as deep learning or regression models. Why is it not as widely used and how does it compare to highly used models?

04 April 2019 1,113 9 View

What are your views on the latest advancements in quantum computing for machine learning algorithms?

This question stems from the latest papers in quantum machine learning. https://www.technologyreview.com/s/613119/quantum-computing-should-supercharge-this-machine-learning-technique/

03 March 2019 5,929 3 View

How will applications be context switched in a quantum computer?

Quantum computers are known to run at least a couple of applications better than conventional computers. When a quantum computer does come about to exist, how will applications be switched in and...

03 March 2019 3,731 0 View

Is Graph-based Machine Learning an upcoming research field of interest?

As machine learning evolves into other domains, it is encompassing domains that use graph structures. What are the algorithms and open-source implementations in such cases?

03 March 2019 2,991 6 View

Are graph algorithms a good fit for future quantum computers? What are the caveats for such algorithms running on such machines?

Quantum computers are known to perform extremely well on a limited number of problems at this time. For real applications, such as path planning and search in graph analytics, are quantum...

03 March 2019 8,568 2 View

Which is the best parallel implementation for Dijkstra's algorithm in graph analytics?

Dijkstra's algorithms performs well sequentially. However, applications require even better parallel performance because of real-time constraints. Implementations such as SprayList and Relaxed...

03 March 2019 4,979 5 View

Are quantum computers going to be used as general purpose machines? or as accelerators connected to the CPUs of today?

Quantum computers are known to perform well on a limited problem space. With fast incoming developments in their technology, are quantum computers going to be used as general purpose machines in...

03 March 2019 8,486 3 View

Why are Nvidia's GPUs more popular than AMD's?

Nvidia has a larger market share of GPU sales. What are the reasons for this larger share?

03 March 2019 10,091 1 View

What is the purpose of performance predictors? Which predictors are the best available?

Performance prediction is required to optimally deploy workloads and inputs to a particular machine/accelerator in computing systems. Different predictors (e.g. AI predictors) come with different...

03 March 2019 10,067 3 View

Why is Intel SGX platform broken in terms of security aspects?

Intel's SGX extensions create isolated application enclaves, which disallow information leakage and unverified access to private data. However, SGX is now known to be broken as some works have...

03 March 2019 4,089 4 View

How do we accelerate synchronization in future multicores having hundreds of cores? What is the right architectural primitive to use in this case?

Synchronization overheads blow up exponentially as more and more cores are deployed on a tiled mesh multicore. Synchronization costs increase as a multicore can only have a limited number of...

02 February 2019 8,963 2 View

What is best parallel algorithm available for breadth-first search that provides best raw performance in terms of complexity and synchronization?

Current parallel BFS algorithms are known to have reduced time complexity. However, such cases do not take into account synchronization costs which increase exponentially with the core count. Such...

02 February 2019 3,217 3 View

What is the right algorithm to stream graphs if they are larger than the machine's main memory?

Graph algorithms such as BFS and SSSP (Bellman-Ford or Dijkstra's algorithm) generally exhibit a lack of locality. A vertex at the start of the graph may want to update an edge that exists in a...

02 February 2019 3,546 2 View

Which accelerator(s) do you think are going to be used in future architectures? What decides which accelerator to deploy in an on-chip setup?

Current architectures already have accelerators integrated within them (e.g. GPGPUs). However future multicores are expected to have more and more cores, and hence more tiles as accelerators....

02 February 2019 6,420 2 View

For any given parallel algorithm or implementation, which parallel machine (e.g. GPUs, Xeon or Xeon Phi) is considered optimal for performance?

Some workloads or even inputs perform well on GPUs, while others perform well on multicores. How do we decide which machine to buy for a generic problem base for optimal performance? Cost is NOT...

02 February 2019 659 4 View

Which is the best parallel implementation to find single source shortest paths for graph analytics?

Dijkstra's algorithm performs the best sequentially on a single CPU core. Bellman-Ford implementations and variants running on the GPU outperform this sequential Dijkstra case, as well as parallel...

02 February 2019 8,722 1 View

Algorithm complexity calculations assume synchronization costs and memory accesses do be done in constant O(1) time. Are these good assumptions?

Synchronization and memory costs are becoming humongous bottlenecks in today's architectures. However, algorithm complexities assume these operations as constant, which are done in O(1) time. What...

02 February 2019 6,310 5 View

Is using Python vs. C/C++ worth it in terms of performance?

C/C++ show better performance than Python due to Python's higher level function calls and wrapping routines. However, Python's time-to-program is lower than C/C++ due to lower language complexity....

01 January 2019 1,415 9 View