What amount of data is necessary to extract meaningful/significant Sequential patterns / rules in a sequential database?

More Dominik Raab's questions See All

Network Analysis: Is it possible to merge two multivariate networks?

Hello, I am conducting an expert study involving two distinct groups of psychology experts (study participants), each being specialized in a different concept. The first group is specialized in...

30 May 2024 4,899 1 View

Hello Community, can someone please recommend me academic studies that use the exploratory factor analysis using not more than 10 initial factors?

I need to access and read academic studies that use the exploratory factor analysis (EFA) as a analysis method with no more than 10 initial factors. I want to explore if this is academically...

07 March 2024 7,044 2 View

How to dissolve poly(styrene-co-4-vinylbenzyl chloride)?

Polymer (60/40) is made with traditional emulsion polymerization (KPS, SDS). I need to dissolve polymer in dmf or dmso for n-alkylation reaction. I found a publication showing how to make that...

20 January 2024 8,204 4 View

Is it reasonable to calculate the percentage polymer crystallinity using the crystallization-peak instead of melting-peak?

Hey so far i know that usually to calculate the percentage crystallinity of a polymer sample the enthalpy of melting is compared to the one of a theoretical 100% crystalline sample of the same...

03 January 2024 1,074 12 View

Can cyclical diketones work in McMurry coupling?

As in questions, On example can dimedone be used in McMurry coupling? Is there any publications on cyclical diketones polymerization using McMurry Coupling?

24 October 2023 1,166 0 View

What is th best way of making free amina base from its salt?

Hello, I have 4-amino-1-methylpiperidine hydrochloride and I need a pure 4-amino-1-methylpiperidine amine base. I think of reversed base extraction with KOH solution and toluene or DCM (if not...

10 October 2023 115 0 View

Book chapter - how do I claim my authorship?

How do I add a book chapter in a published book, whose editor is someone else to my list of publications? The book is referenced, has a DOI and is online at Research Gate already. Thank you

26 September 2023 3,599 1 View

Expert Study: What data analysis method to use when the sample size is small?

I am conducting an expert study in which I ask experts in a specific subfield of psychology to select resources that are pertinent to the target concept. Specifically, they are tasked with...

21 September 2023 6,242 4 View

How much money do cities in Europe spend on green space management?

Are there any publications that show how much money cities in Europe have spent on green space management in recent years?

16 July 2023 4,962 2 View

OOMMF: How to simulate the stay field of micro magnets ?

Hi, I'd like to simulate the stay fields of micro magnets with different geometries. The goal is to get the shape of the stray field and the value of the strength of the field at a specific...

04 June 2023 966 0 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Nahian Ahmed

There are several issues that need to be addressed concretely:

What is the 'length' of the rules i.e. how many parameters/factors/features are a rule dependent on?

Do all the rules have the equal 'length'?

Does 1000 sequences mean that there are 1000 samples of sequences?

Are the rules and sequences overlapping or are they disjoint?

What does the 'confidence' actually mean? Is it the confidence in frequent itemset mining? Is it the percentage of of all possible rules? Is it the probability of encompassing all possible rules? or something else?

To answer your question, the amount of data that is necessary is dependent on the questions mentioned above. For example, for rules of higher 'lengths', more data would be necessary.

Hope this helps.

Erik Cuevas

I am not sure, but if there is a lack of data, maybe techniques as bootstart can be adequated.

Uday Kiran Rage

As a sequence represents a collection of transactions, sequence databases have relatively less number of rows. For real world applications, small datasets are fine. What kind of nontrivial knowledge your discovering is important than size of data. I am currently working on real world applications, where data has no more than 100 lines, but the length of each sequence is very huge.

For publishing, my advice is as follows. Choose synthetic very large datasets from SPMF library to demonstrate that our model/algorithm scales well. Simultaneously, employ your real world small database to demonstrate that your model can discover useful information.