Can CUDA perform a real-time compilation (e.g., using NVRTC) to dynamically create a "__device__" function used by a kernel?

More Marco Salvatore Nobile's questions See All

Do you think can be any Uranium bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about Uranium ore deposits in world.

11 August 2024 6,720 0 View

Do you think can be any diamond bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about diamond ore deposits in world.

11 August 2024 2,167 1 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

If Banks do not provide credit facility, what are the options available for FPOs and impact on producer’s income?

10 August 2024 8,198 5 View

Controlling for pupil light reflex when analyzing pupil size time course?

I used eye tracking to examine how participants from two different populations (A and B) react to an image. Participants in population A exhibit larger pupil sizes over time, but they also have...

10 August 2024 3,229 0 View

What are a “Farmers Producer Organization” (FPO) and its essential features?

10 August 2024 477 5 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

Do interactions between biosphere, carbon cycle, & water cycle impact global warming & interaction between atmosphere & hydrosphere?

How do interactions between the biosphere, the carbon cycle, and the water cycle impact global warming and interaction between the atmosphere and the hydrosphere?

09 August 2024 3,291 2 View

How to get moment output in Abaqus Standart?

I have input a moment load in module load Abaqus, i put my moment load on the node surface (using reference point). I have define moment in history output and make a set for moment too. But the...

08 August 2024 4,831 4 View

How is energy cycled through the Earth's climate system and how do matter cycle and energy flow through the rock cycle?

08 August 2024 8,162 0 View

Could you recommend some articles on Urban Transportation System optimization and Innovation?

13 August 2024 2,595 3 View

Separation of organic acids-HPLC?

Hello What should be done to separate and identify organic acids in HPC when their RetTime is the same?Like oxalic acid with Propanoic Acid.or acids that have a very close RetTime.

07 August 2024 8,782 3 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

Why does the MFDFA algorithm need to calculate the profile of the time series?

As described in the Multifractal detrended fluctuation analysis (MFDFA) algorithm, it at first calculates the profile of the time series, and then other steps are operated on the profile....

05 August 2024 9,366 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

Is there any machine to do real time pcr?

I want to know how do you make real time pcr solation ? is there any machine to make it? thanks for answering

05 August 2024 1,660 0 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Which test should be used to study association among demographic profile and awarness level?

i have to study the awareness and adoption level of cloud computing in a district of India. i also want to use association among demographic variables like gender, age, education, income etc and...

02 August 2024 2,420 3 View

What do you consider to be the most relevant elements of EEG for studying cognitive biases?

I've seen articles that primarily focus on alpha and beta activity in the frontal regions, but these studies often compare healthy subjects with those having various pathologies. I haven't seen a...

31 July 2024 7,259 1 View

A Question about Phd thesis?

Hello everyone What is your opinion about the introduction of an expert decision support system in which the rules are extracted from existing data without human intervention, instead of being...

31 July 2024 5,785 4 View

Fritz J Sedlazeck

Its been a while I last used CUDA , but I doubt it. I think e.g. OpenCL supports something like this.

Panchatcharam Mariappan

I have not tried that so far. But, as per this page, there is an option called --device-c (-dc).

http://docs.nvidia.com/cuda/nvrtc/index.html#axzz4GwvjLyCt

Ibrahim Al Kharusi

It is not really clear to me , but if the function can be part of an object (class /structure) then yes you can override it. Additionally, if the difference is data type, you may define your function using template.

Marco Salvatore Nobile

The function is a normal __device__ function. It is not part of a class nor structure. The difference is not the data type, so I cannot rely on templates. I actually must change the calculations performed in the function (i.e., propensity calculations) according to the model that I am simulating.

I am adding this information in the original post.

for override part:

I had conducted basic code test and I found it does support override you can use the following to test it as well:

define in your cuh (header file )

__device__ int test_Max_override(int, int );

__device__ int test_Max_override(int, int,int );

in your implantation file

__device__ int test_Max_override(int x, int y){

if(x>y)return x;

return y;

}

__device__ int test_Max_override(int x, int y, int z){

if(x>y && x>z) return x;

if(y>x && y>z) return y;

if(z>x && z>y) return z;

while it is defined as __device__ than it should be called from global or other __device__ function i.e. you can not call it from host function.

I assume this override with number of arguments; it will also work with different arguments data type by modifying one of above with different datatype .

Related to global variables performance you may explore using __shared__ which will be within the scope of the grid-block.

I am not for "real-time compilation (e.g., using NVRTC) to dynamically create a "__device__" function" point if I understand it correctly, I assume you can use system call system(" YOUR EXEC COMMAND")from your code to execute NVRTC with your cuda files as work around.

I hope this answer your question.