How can I compute divergence between two vocabularies using Kullback-Leibler distance?

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Is it possible to plot the atom-projected band structure using GPAW?

Hi, I'm currently working on a project where I need to plot the atom-projected band structure using GPAW. I've been able to calculate the band structure for my material, but I'm having trouble...

07 August 2024 269 3 View

Should I include H atom into C3N5 when i am doing DFT modelling?

Hi all, my experimental XPS results shown that my C3N5 sample consists of N-H bond, hence in this case I should incorporate the N-H bond into my DFT modelling. However, I do notice several papers...

07 August 2024 8,414 2 View

Are there any good simple systems or platforms to recommend?

In order to show people the beauty of control and enhance enthusiasm for learning control theories, are there any good simple systems or platforms to recommend?

05 August 2024 10,034 1 View

"A Markov-like Model for Patient Progression"?

A Markov-like Model for Patient Progression" Markov Chain Monte Carlo (MCMC) Markov Chain Monte Carlo (MCMC) is a powerful computational technique used to draw samples from a probability...

05 August 2024 10,079 0 View

Why do exism movements become permanent dictatorship threats within liberal democracy thinking under majority rule-independent rule of law system?

Exism movements after gaining power within liberal democracies under majority rule and independent rule of law system become permanent dictatorship threats, but why this is the case is not clear...

04 August 2024 8,125 3 View

How to use Density Functional Theory to calculate carrier mobilities of solid system?

Hello, everyone. I have tried to determine carrier motilities of some materials, by Density Functional Theory, using Quantum ESPRESSO. There are a few methods to do it, like a package called...

04 August 2024 8,894 1 View

How to develop an academic literacy program for engineering at the higher education level?

Information literacy in higher education integration with curricula engineering

04 August 2024 5,368 3 View

How to Compress Information Neurally?

Samuel Morse, the inventor of the Morse Code, understood that certain letters in the English language occurred more frequently than others (Gallistel and King 2010). To deal with this, Morse used...

01 August 2024 4,456 2 View

What exactly is RAG-LLM doing? Isn’t it data engineering?

What exactly is Retrieval Augmented Generation for Large Language Model doing? Isn’t it data engineering?

30 July 2024 7,376 3 View

Efraín Antonio Domínguez Calle

If I understood well you have two set of words and you want to compare them, so what I suggest is to: first, to establish the distribution law they follow and second to compare found distribution laws, through Kullback-Leibler distance. The first step can be done at token or word level, depends on your goal. To do this at word level you will need a big sample of words and to index each word. If it happens you use python programming language there is the module "collection" that easily allows you to calculate the frequency of words (see for example attached link). Also in python you can find a module to deal with entropy calculations.

http://stackoverflow.com/questions/2161752/how-to-count-the-frequency-of-the-elements-in-a-list

http://docs.scipy.org/doc/scipy-dev/reference/generated/scipy.stats.entropy.html

Stefan Höst

I am not sure I understand your question fully. Can you elaborate a bit? As I understand you want to compare distributions for different alphabets. The KL divergence is derived for two distributions over the same alphabet.

If you want to compare different distributions the KL divergence is one option. However, the asymmetries makes it a bit hard to work with. There are other alternatives, and viewing from an information theoretic point I would point at Jensen-Shannon divergence, see e.g.

http://en.wikipedia.org/wiki/Jensen-Shannon_divergence

Karima Meftouh

Hi Stefan,

what I want to do is to compute the divergence between two languages using the KL ie between two corpora of two distinct languages.