What NLP approach system?

More Chandan Sharma's questions See All

Which type of compound does lamda max of 218 indicate in a uv-vis spectrum of a partially purified compound through column and TLC?

A crude extract of fungal culture using EtOH was subjected to column and TLC and partially purified compound was obtained. UV vis spectrum of the compound/s has max absorbance at 218nm. The...

11 August 2024 9,801 2 View

How does grain and grain boundary affect the ceramic when studying its dielectric properties?

I am not able to get good literature and the physics behind how first these grains and grain boundaries arises out of no where when we make a pellet to study its dielectric properties and then how...

07 August 2024 5,177 3 View

Conjugation of PEG-Amine to an Amino Acid Using EDC?

I am attempting to conjugate PEG to an amino acid at the C-terminus, for the purposes of producing nanoparticles. I have been told that PEG modified with amine groups can be used for this purpose,...

31 July 2024 2,033 1 View

Does soybean seed coat or cotyledon contain chlorophyl or flavonoids? What types? determined by paper chromatography? other methods (high school lab)?

Am trying to develop a lesson/lab to determine and compare antioxidant properties of soybeans of various colors. Preference would be low tech and low cost. Any assistance is greatly appreciated.

25 July 2024 7,498 2 View

Reason for discontinuities in my Band structure?

Hey All! I am wondering what might be wrong with my band structure. I did the calculations using VASP and plotted the results using Origin. Although I have tried changing various input...

25 July 2024 2,920 11 View

If my gene of interest has high GC content can it be problematic in sequencing? What kind of error is expected with GC rich gene sequences??

Gene sequencing related trouble shooting

25 July 2024 4,149 2 View

How to dispose off lipids waste?

I am looking for a lipid waste disposal method, keeping in mind the environmental, health and safety aspect of lipid waste. Could someone please provide guidelines or the lowest acceptable...

25 July 2024 637 5 View

Does post-translational protein modification cause devisions on observed pI verses calculated pI?

In running two-dimensional gel electrophoresis on bacterial protein, some spots that appear to match a protein sequence have a significantly more acidic isoelectric point than the calculated pI....

24 July 2024 8,076 3 View

What publications should I target as a psychology masters student in the UK?

I am writing a paper as a part of my course. I am new in London and was wondering that what publications should look upto?

21 July 2024 3,538 1 View

How I can add anew research in my account ?

Their is new research published in the biological control journal, and I am one of the authors. I want to add it to my account at researchgare. How can I do this?

21 July 2024 7,545 0 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Do you know best mines of western part of Afghanistan?

I want to know more about Mn deposits in west of Afghanistan.

07 August 2024 3,427 1 View

Is Galaxy.org good to use for research for analyzing data and for publication?

Hello all, I wanted to know, can I use galaxy (USA, Europe or Australia) platform for analyzing the shotgun data, and can it be used for publication purpose as well? Thanks :)

06 August 2024 6,610 4 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

How can I interpret the data without the need of solving it manually?

How can I interpret the data gathered without solving?

03 August 2024 9,054 3 View

Why can't academics earn the money they deserve?

Only Journals make money from the articles we have worked on for years. Academics do not earn money from their refereeing. Then shouldn't the solution be a system in which academics can earn...

01 August 2024 6,469 6 View

Conjugation of PEG-Amine to an Amino Acid Using EDC?

I am attempting to conjugate PEG to an amino acid at the C-terminus, for the purposes of producing nanoparticles. I have been told that PEG modified with amine groups can be used for this purpose,...

31 July 2024 2,033 1 View

Fabrice Clerot

unsure what you mean by "unique"

if two documents differing by one word only are considered two unique documents, hashing techniques will provide an easy way

for instance, hash any word to a 0/1 sparse string, OR-accumulate all the hashes of the words in the document to get the "signature" of the document and compare to the signature of the documents of the database ; this scales linearly with the size of the database and it is possible to speed-up by structuring the signature space

now if unique means "thematically unique", the above will obviously fail !

Chandan Sharma

Fabrice Clerot thanks for response,

unique means " if two documents differing by one word only are considered two unique documents, " satisfactory ...

Thomas Hoppe

Hi Sharma,

with this shallow requirement you could just use some MD5 check sum to figure out whether a document is unique.

More interesting is of course the question of "how to identify dublicates within large amounts of documents?". For that I would use minHashing & Locality Sensitivity Hashing, described in "Mining Massive Datasets".