Is Latent Semantic Analysis (LSA) can work to extract semantic from text or documents?

More Abdiansah Ns's questions See All

No dUTP supermix in PCR?

HI, Can someone explain me how a regular PCR supermix and no dUTP supermix differs in a PCR reaction. When exactly we should use a no dUTP supermix in a PCR reaction Thank you

13 May 2021 592 2 View

How can I simulate calcination process in Ansys-fluent ?

I do a project about simulation of calcination process using Ansys-Fluent. The question raise in my head is that if I want to use mixture model or Eulerian-discrete model, I can’t Add solid...

14 July 2019 1,708 0 View

Optimisation of organometallics?

Hello, I'm a beginner to computational chemistry, I'm attempting to optimise [Re(OH2)(CO)3(phen))]+ using gaussian. I have tried to optimise the structure using B3LYP method in gen mode SDD for...

20 December 2017 5,167 8 View

How to optimise Re complexes using gaussian?

Hello, I have to optimise [Re(OH2)(CO)3(phen)]+, I have made the structure on gaussview, I have attempted to optimise the complex with B3LYP method in gen mode (SDD for Re and 6-31g(d,p) for...

13 November 2017 4,785 5 View

Are there any suppliers of SYBR Green dye alone?

Are there any suppliers of SYBR Green dye alone (without taq/matermix)? We dont want any dye with master mix/dNTPs

04 October 2017 4,376 1 View

How to deposit TiO2 on commercial SnO2 powder?

I have commercial SnO2 powder and want to deposit TiO2 from its precursor (TTIP) on that. I was wondering if just adding alcohol solution of TTIP on SnO2, following by heat treatment can give me...

18 February 2017 1,646 2 View

Does pyramidins melts completely in a PCR reaction?

According to the literature, melting point of purines is 214 °C Pyramidines have a much lower melting point i.e., 20 to 22 °C Does than mean during a PCR cycle pyramidines degrades completely? (I...

29 August 2016 3,011 3 View

How to dope a synthetic nanoparticle with another pre-prepared commercial one?

I want to make a bimetallic nanoparticle. I can perfectly synthesize one of them (Pd) with using its precursor, but the other one (SnO2) is a pre-prepared commercial nanopowder. Does anyone have a...

18 February 2016 7,912 5 View

Collaborative research in India

I would like to collaborate with the Taiwan professors for a research for the following call for proposal from ICSSR, India and NSTC, Taiwan "Indian Council of Social Science Research (ICSSR) and...

01 January 1970 3,434 0 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Is there a problem with my RNA pellet?

Hello, I am currently having problems with RNA extraction. I am using mouse liver (C57BL6J), and I have extracted RNA from mouse liver before. Before this experiment, my final RNA pellets were...

11 August 2024 7,082 3 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

RNA Extraction Using Hot Borate Method No Longer Working?

I've been performing RNA extraction on cotton petiole tissue for a few months now using the method described in the following paper, a derivative of the typical hot borate method...

08 August 2024 9,882 2 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

Volker Lohweg

The simple and clear answer is yes. We have applied LSA as a pre-processing step in automatic patent application and patent analysis. Of course you need also a good classification method for your features. Unfortunallly our publications on this matter are in german language.

Marcin Michał Mirończuk

Hi,

very shortly and generally the LSA is the method to reduce feature space and in this way, for example, the semantics group of documentc could be created. Suppose you have documents group D_Informatic that describe the computers, data base etc. and another document group D_animal that describes the dogs, cats etc. For the D_Informatic you could have the features such as myslq, postrgresql, ibm, iphone. For the D_animal you could have the features such as persian_cat, siamese, Bulldog etc. The LSA for these data set could create, for example, the two new feature (f1 and f2) that separated these two data set and in this way you receive a ''semantic'', i.e. the feature f1 code D_Informatic and f2 code D_animal (it's kind of generalization process). First you must define what's that mean a "semantic extraction". The LSA don not gave you explicite, for exaple, that postrgresql is a database etc.

Rogier Brussee

@Volker Where can I find your papers? I would be interested in your papers, I can read German without problems.

Generally there is something there in distributional semantics and statistical methods like latent semantic analysis, latent Dirichlet analysis, or the co-occurence method Christian Wartena and I devised (see e.g. http://www.researchgate.net/profile/Christian_Wartena/publication/221466114_Instanced-Based_Mapping_between_Thesauri_and_Folksonomies/links/00b495187d1050d50c000000.pdf sorry about the stupid typo in the title :-( or even older work with Anjo Anjewierden, Robert de Hoog, Lilia Effimova and myself https://www.researchgate.net/publication/228341304_Detecting_knowledge_flows_in_weblogs)

Generally speaking what you pick up is the fact that words tend to correlate if they are semantically related and these correlations can be used as a proxy for the semantics themselves. You do have to realise that the proxy is rather crude: it is based on a bag of word model of language, heavily depends on what documents you put in to learn the correlations (and therefore tends to do rather better when the documents have a clear focus) and finally are not based on any world knowledge.

Conference Paper Instanced-Based Mapping between Thesauri and Folksonomies

Article Detecting knowledge flows in weblogs

Hello Rogier,

We will upload an actual paper which will be presented on a gernam conference on computational intelligence next week.

Best,

Volker

Peter Foltz

Yes, Latent Semantic Analysis can be used semantic representations from large sets of text. Generally, you can create representations of individual words, or larger units of text (e.g., sentences, paragraphs, or whole documents). Typically one compares the vector representations of one unit text to another (e.g., similarity of "dog" to "cat" or of a document to a query, or of one document to another. One place to get started if you want more of a feel of what you can do is at the website LSA.colorado.edu. It allows you to experiment with several semantic spaces. You might also look at Landauer, Foltz & Laham 1998 "An introduction to Latent Semantic Analysis" for some of the basics.

As others have mentioned, there are considerations, such as what documents or corpus you use to train the system that can affect the representation and that not all aspects of semantics are accurately represented in an LSA analysis. Your choice of tools depends on what you need to accomplish.

Article An Introduction to Latent Semantic Analysis

Shaishav Agrawal

The Term Document matrix {X} has decomposed using SVD as follows:

{X}={W}{S}{P}'

After this these decomposed matrices have been reduced and the product of these reduced matrices has been calculated as {X`}. Then the similarity between two terms have been calculated using correlation between the vectors of terms from this matrix.

I have a question: Can we calculate term similarity only from {W} matrix after reduction? Is {W} matrix also give similar type of relation between terms as {X}?