What are the most recent and challenging real-world data sets and synthetic data generators?

More Soroosh Shalileh's questions See All

Which research is suitable for starting simulation of plasmonic waveguides?

I'm going to study on waveguides based on plasmonic effect. So, i need to start a way by simulation of an appropriate article. If anyone introduce a manuscript, it's high appreciated.

04 September 2018 2,864 6 View

Is there any handbook containing flow stress curves of different steel grades?

I want to see the work hardening behavior of some die and tool steel grades. There should be a handbook containing this information already.

25 September 2013 4,358 5 View

What are the opposing reasons to polycentric approach in urban-regions?

While polycentric urban regions identified by scholars as a new urban phenomenon and is a spreading pattern worldwide, what are the opposing reasons to this approach? Are there any ideas...

28 May 2013 8,647 1 View

Is it possible to apply regional urbanization to all metropolitan areas across the globe?

Or did all these urban-regions emerge/are they emerging in developed countries that had some more characteristics than the defined dynamics in common which led them to a polycentric region?

07 April 2013 6,916 1 View

Comparing regional urbanization in European context and developing countries, what is the situation on New Towns in regional urbanization?

In developing countries like Iran governments use new towns as a tool to overcome the imbalances in urban-regions. My question is what is the situation of european countries on the new towns,...

25 March 2013 7,321 7 View

Academic Collaboration Invitation

Dear Esteemed Colleagues, I am excited to extend an invitation to collaborate with us on an innovative and cutting-edge project focused on the design of plasmonic-based devices. Plasmonics...

01 January 1970 7,271 1 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

I am using unit level data (IHDS round 2) & Stata 17

06 August 2024 5,725 2 View

Research Methodology - Impact of Corporate Reputation on Stakeholders Behaviors?

Please can anyone support with the survey questions based on RQ measures and propose how to do it in FMCG industry and include as well the role of brand equity Thanks

06 August 2024 949 0 View

I need the datasets of Microgrid for system identification?

Hi I am working on data driven model of the microgrid, for that, i need the reliable datasets for the identification of MG data driven Model. Thanks

02 August 2024 5,748 4 View

Normality assumption for linear regression is The assumption of normality is whether for residual errors or predictor variavble?

When we conduct linear regression, there are several assumptions. The assumption of normality is whether the residual errors are normally distributed, not whether a predictor is normal?

31 July 2024 6,164 3 View

Which file formats are accepted for supplementary material?

I have a dataset consisting of json files. i tried to upload a zip or tar of it but the system tells me that the file format is not accepted... br

25 July 2024 1,316 3 View

Dataset of synchronized cardiac angiography and ECG?

Hello, I'm working on medical project and I would need synchronized angiography with ECG? Does anyone know if some open source dataset of this kind exist? Regards, Bruno

25 July 2024 2,214 2 View

Is it redundant to use both Random Forest and Decision Tree algorithms in the same regression project?

I am currently working on a regression model for a project and considering using both Random Forest and Decision Tree algorithms. Given that Random Forest is essentially an ensemble of Decision...

23 July 2024 4,306 3 View

Looking for articles that provide guidance on doing Thematic Analysis (Braun & Clarke) as a team?

Does anyone know of any published articles that provide guidance on using TA with multiple team members? I am aware of one (see below), however, there are some disagreements in terms of some of...

23 July 2024 7,819 5 View

How to Select the most suitable machine learning algorithm depending on the characteristics of the given dataset ?

I'm working on a project that involves analyzing a new dataset, and I'm at the stage of selecting the most appropriate machine learning algorithm. The dataset consists of both numerical and...

22 July 2024 6,097 7 View

David Eugene Booth

Google search is still in business I believe.. Best wishes, David Booth

Jochen Wilhelm

You can look for gene expression datasets. There are many, they are large, they can be downloaded as tabular data, and the problems are complex.

For instance:

https://www.ncbi.nlm.nih.gov/geo/

https://hbctraining.github.io/Accessing_public_genomic_data/lessons/accessing_public_experimental_data_odyssey.html

Tomas L Bothe

If you're looking for clinical data check out the MIMIC IV clinical dataset. The dataset has over 200 million entries. It's primarily a data cleaning task before you can use the dataset for any application.