Sentiment Analysis

Sentiment analysis is one of the oldest tasks in natural language processing. In this note we will introduce some examples and terminology, some key problems in the field and a simple model that we can understand by just knowing Backpropagation Log Linear Models and the Softmax…

Log Linear Models

Log Linear Models can be considered the most basic model used in natural languages. The main idea is to try to model the correlations of our data, or how the posterior p ( y ∣ x ) varies, where x is our single data point features and y are the labels of interest. This is a form…

Anomaly Detection

Anomaly detection is a problem in machine learning that is of a big interest in industry. For example a bank needs to identify problems in transactions, doctors need it to see illness, or suspicious behaviors for law (no Orwell here). The main difference between this and…

October 30, 2024 · Reading Time: 2 minutes · By Xuanqiang Angelo Huang

Language Models

In order to understand language models we need to understand structured prediction . If you are familiar with Sentiment Analysis , where given an input text we need to classify it in a binary manner, in this case the output space usually scales in an exponential manner. The…

Part of Speech Tagging

What is a part of Speech? # A part of speech (POS) is a category of words that display similar syntactic behavior , i.e., they play similar roles within the grammatical structure of sentences. It has been known since the Latin era that some categories of words behave similarly…

Softmax Function

Softmax is one of the most important functions for neural networks. It also has some interesting properties that we list here. This function is part of The Exponential Family , one can also see that the sigmoid function is a particular case of this softmax, just two variables.…

The Exponential Family

This is the generalization of the family of function where Softmax Function belongs. Many many functions are part of this family, most of the distributions that are used in science are part of the exponential family, e.g. beta, Gaussian, Bernoulli, Categorical distribution,…

Dependency Parsing

This set of note is still in TODO Dependency Grammar has been much bigger in Europe compared to USA, where Chomsky's grammars ruled. One of the main developers of this theory is Lucien Tesnière (1959): “The sentence is an organized whole, the constituent elements of which are…

Accessibilità

Ci chiediamo come facciamo a rendere sistemi informatici accessibili a persone attraverso certe tecnologie. Slide esempi di disabilità È meglio renderlo accessibile perché è illegale (nel senso che stai facendo una discriminazione verso un certo insieme di persone). WGAC #…

Ambienti di sviluppo

Ambienti di sviluppo # Ambiente di sviluppo è diverso rispetto all’ambiente di deploy! bisognare fare delle differenze, sono dell macchine diverse, in questa sezione di documenti andiamo a parlare di norme e modi di lavorare per facilitare il metodo di sviluppo. Note di…