Find data that a distribution whose variable domain has one of its parameters as its minimum is a herculean task?

More Okechukwu J. Obulezi's questions See All

If you have access, can you kindly help me?

Dear Colleagues, If you have access to the following book, kindly send it to me. It is urgently needed. "Progressive Censoring: Theory, Methods, and Applications (Statistics for Industry and...

23 December 2023 8,237 0 View

Can anyone help me with this article "Writing a scientific article: A step-by-step guide for beginners"?

Dear colleagues! Kindly assist me with the article titled "Writing a scientific article: A step-by-step guide for beginners" by F. Ecarnot, M.-F. Seronde, R. Chopad, F. Schiele, N....

09 December 2023 2,466 5 View

Urgent assistance from colleagues?

Can someone please assist me with the following book? I do not have funds to access it. "Extreme value distributions: theory and applications" Authors Samuel Kotz, Saralees Nadarajah

11 October 2023 5,622 1 View

What is the similarity or differences in the use Monte Carlo or GLM?

Do Monte Carlo or Generalize Linear Model do the same thing? What are their difference? Which is best for a count data and why?

26 June 2023 3,683 2 View

Survival Analyst, can you help with explanation?

The first plot is supposed to be the proper plot of the survival function of my distribution with t=0 but function seem to extend down the negative part hence I had extend the range of values and...

24 June 2023 7,367 0 View

What tools can i use to analyze metabolic pathway of a bacteria to check if it can be used to express a foreign metabolite of interest?

I am trying to metabolic engineer Salmonella typhimurium to synthesize a molecule by interest, How can i analyse the pathways of Salmonella typhimurium for the efficient production of this...

02 April 2023 5,812 1 View

How can I measure the amount of CO2 produced during yeast culture?

I am monitoring the effect of algae on yeast growth, one of my parameters is the CO2 production rate. I would like to know the suitable method that will allow me to measure the CO2 while it is...

01 January 2021 5,904 11 View

What volume of liquid yeast would be ideal to inoculate into a liquid Sabouraud media?

I am working on enhancing brewers yeast viability, and I am using Liquid Sabouraud media and liquid Saccharomyces cerevisiae. Initially, I made a single serial dilution and inoculated 1ml of the...

24 October 2020 761 7 View

Why do I detect very faint my protein of interest in co-IP but could not detect the protein individually in IP?

I tagged two proteins (A and B) of interest that i seek to study their interaction with Myc and HA respectively. From input, myc tagged protein was robustly present in my samples. Using myc...

31 August 2020 1,066 2 View

What is the reliability of suicide prediction using Machine learning?

Due to the advancement in machine learning and it's application in psychiatry and clinical psychology, their is a need to understand the reliability of various programming software for predicting...

23 November 2019 2,788 7 View

I need the datasets of Microgrid for system identification?

Hi I am working on data driven model of the microgrid, for that, i need the reliable datasets for the identification of MG data driven Model. Thanks

02 August 2024 5,748 4 View

Which file formats are accepted for supplementary material?

I have a dataset consisting of json files. i tried to upload a zip or tar of it but the system tells me that the file format is not accepted... br

25 July 2024 1,316 3 View

Dataset of synchronized cardiac angiography and ECG?

Hello, I'm working on medical project and I would need synchronized angiography with ECG? Does anyone know if some open source dataset of this kind exist? Regards, Bruno

25 July 2024 2,214 2 View

How to Select the most suitable machine learning algorithm depending on the characteristics of the given dataset ?

I'm working on a project that involves analyzing a new dataset, and I'm at the stage of selecting the most appropriate machine learning algorithm. The dataset consists of both numerical and...

22 July 2024 6,097 7 View

How to use evolutionary algorithms with real parameters in ryu sdn controller with large scale?

Hi, I wanna to implement evolutionary algorithms in ryu sdn controller in mininet, i have some challenges, how i can run the big scale topo with one sdn contoller??? and another question is to...

21 July 2024 246 2 View

How to use NCBI datasets ?

I have been trying to extract genome from NCBI using their dataset tool, however some examples seem not to work : ./datasets download genome taxon "Homo Sapiens" --annotated --assembly-level...

20 July 2024 1,339 2 View

How do I access .vcf files without an R statistical package?

I am currently working on a mendelian randomization study, and I have downloaded the datasets needed from the ieu opengwas project (mrcieu.ac.uk) in .vcf format. I do not have access to an R...

19 July 2024 2,342 5 View

Which is the best approach for anomaly detection in scanned image data set?

Anomaly detection in scanned image data set

18 July 2024 3,578 3 View

"Hello, I am trying to find public datasets containing FTIR spectra of blood samples (both healthy and disease-related)?

These datasets will be used in the training of machine learning algorithms. Does anyone know any available data?"

17 July 2024 6,519 3 View

Analysis of MHC-I and II alleles with CNVs and unassigned loci?

I am working on a dataset of MHC-I and II alleles from a bird species sequenced with Illumina. We were not able to assign alleles to loci through MHC-typer as we were over the limit of 150 alleles...

15 July 2024 182 1 View

Najla Matti Isaacc

Finding datasets that specifically fit a Pareto distribution or a shifted distribution like the Shifted Lindley can be challenging, as it requires data that aligns with the specific characteristics and parameters of these distributions. However, I can suggest some potential sources and strategies to explore:

Public Datasets: Look for publicly available datasets in various domains such as economics, finance, social sciences, or engineering that might exhibit heavy-tailed behavior. Websites like Kaggle, UCI Machine Learning Repository, or data.gov can be good starting points for finding diverse datasets.

Economic and Financial Data: Economic and financial data often exhibit heavy-tailed distributions, and certain phenomena in these domains might be suitable for modeling with Pareto or shifted distributions. Examples include income distribution, wealth distribution, stock returns, or insurance claim amounts.

Health and Biological Data: Some health-related data, such as disease prevalence, hospitalization costs, or genetic variations, might exhibit heavy-tailed behavior and can potentially be modeled with Pareto or shifted distributions.

Simulation and Synthetic Data: If you can't find real-world datasets that match the specific distribution you're looking for, consider generating synthetic data using simulation techniques. You can simulate data based on the desired distribution parameters and characteristics to create a custom dataset for experimentation or testing.

Data Transformation: In some cases, you might find datasets that do not directly fit a Pareto or shifted distribution but can be transformed to approximate such distributions. You can apply appropriate transformations (e.g., power transformations) to the existing data to achieve a better fit with the desired distribution.

Data Generation: If you have domain expertise or theoretical knowledge about a specific phenomenon that follows a Pareto or shifted distribution, you can generate synthetic data based on mathematical models or assumptions. This approach can be useful when real-world data is limited or unavailable.

Remember that the availability of datasets fitting specific distributions may vary, and it might require some effort to find datasets that precisely match the parameters and characteristics of the distributions you are interested in. Be prepared to explore multiple sources, potentially preprocess data, or resort to simulations or synthetic data generation techniques to create or approximate the desired distribution.

Okechukwu J. Obulezi

@Najla Matti Isaacc many thanks for your suggestions. I will navigate my way using your answer and give you feedback on the progress made

Md. Akiful Islam Fahim

Finding data for a distribution where one of its parameters serves as the minimum can indeed be a formidable undertaking. The quest for such datasets could be likened to a Herculean task, demanding immense effort and perseverance. However, I shall endeavor to aid you by suggesting datasets that might align with your interest in Pareto distributions and shifted distributions like the Shifted Lindley. While the availability of specific datasets tailored to these exact requirements may be limited, a potential approach could involve exploring various domains, such as economics, finance, or social sciences, where Pareto-type distributions frequently emerge. By delving into these fields, scrutinizing datasets encompassing income distribution, wealth accumulation, or even power-law phenomena, you might discover instances where the characteristics of the desired distributions align, albeit with potential adjustments or transformations. It may necessitate careful data exploration, preprocessing, and model fitting to achieve a satisfactory fit to your desired distribution, but with determination, creative thinking, and the utilization of available resources, you can potentially unearth datasets that offer valuable insights into the domain of interest.

Many thanks Md. Akiful Islam Fahim