Post by Category

Readings (16)
Research (5)
Talks (2)
Teaching (12)
WorkingWithData (4)

Readings (16)

Reading Notes - A Joint Model for Question Answering and Question Generation

June 23, 2017

This post is my quick summary of what I understood an article I read, by a team from Microsoft Maluuba.

Reading Notes - Question Answering and Question Generation as Dual Tasks

June 23, 2017

I have been reading about automatic question generation on and off for about an year now. There has been an increase in stuff that is coming up since last ye...

Reading Notes - Active comprehension (1982 article)

June 27, 2017

This post is my quick note-taking about the following 1982 article: Singer, Harry and Donlan, Dan. Active comprehension: Problem-solving schema wit...

Reading Notes - Ranking Automatically Generated Questions Using Common Human Queries

June 27, 2017

This post is about the following short paper:

Reading Notes - Generating Natural Language Questions to Support Learning On-Line

June 27, 2017

Generating Natural Language Questions to Support Learning On-Line

Reading Notes - A Joint Segmentation and Classification Framework for Sentiment Analysis

June 29, 2017

This is a reading summary of the following paper:

Reading Notes - Cost-Sensitive Learning vs. Sampling

July 6, 2017

This is a quick summary of this short paper:

Reading Notes - Text Preprocessing in Neural Network Architectures

July 7, 2017

I came across this short paper today in the ArXiv digest. This post is a quick summary.

Reading Notes - Class Imbalance, Redux

July 10, 2017

This post is a summary of the following paper

Universal Dependencies for several languages

July 22, 2017

Thanks to the recent discussions with @phylostar, I have been reading quite a bit on Universal Dependencies(UD) and on the creation of manual annotations fol...

Reading Notes - A Critical Appraisal of Deep Learning

January 3, 2018

This are some notes I am making about the following article I came across on ArXiv.org:

Reading Notes - Baselines and Bigrams

January 5, 2018

Today, I came across a tweet by François Chollet:

Reading Notes - Evaluating the stability of embedding based similarities

March 5, 2018

This post is a short summary of the following research paper

A 1980s discussion in CL Journal on parsing ill-formed input

March 9, 2018

Having 5 hours to kill in the airport, and not wanting to browse, I started checking the old issues of Computational Liguistics journal, which was then calle...

Reading Notes- how neural language models use context

May 20, 2018

This post is my comments/notes about the following paper:

Readings and Thoughts on deploying NLP/ML models

June 28, 2018

Over the past few weeks, I spent quite a bit of time reading and thinking about the difference between academic and industry R&D for NLP and Machine Lear...

Research (5)

Telugu Treebank in Universal Dependencies

January 23, 2018

Several months back, in July 2017, I wrote about Universal Dependencies and wondered about not seeing anything for Telugu. Thanks to @phylostar, I ended up i...

Thoughts and Experiments with NER-1

June 2, 2018

I have been spending quite sometime in the past 2-3 weeks reading and working on Named Entity Recognition aka NER, which is a sub-problem of information extr...

Thoughts and Experiments with NER-2

June 14, 2018

In the last post, I wrote about the general idea of Named Entity Recognition, the issues I noticed in the way researchers discuss the problem in contemporary...

Keyphrase Extraction from Documents - 1

July 31, 2018

For the past few weeks, I have been working on automatic keyphrase extraction from documents. This post is the first in (hopefully) a series of posts to note...

The problem with off-the-shelf Named Entity Recognizers

November 13, 2018

A few months back, I wrote two posts about Named Entity Recognition (Part 1, Part 2), which is the language processing task that is concerned with identifyin...

Talks (2)

Talk on Research Methods in Computational Linguistics

March 7, 2018

I gave a guest lecture today in a research methods class of comprising of graduate students from Rhetoric and Professional Communication. When Prashant Rajan...

Tutorial on NLP with Python

May 18, 2018

I gave a tutorial at the Mid-west Big Data Summer School 2018, held at Iowa State University, USA. It is titled “Natural Language Processing with Python” and...

Teaching (12)

Teaching Notes - Teaching about encoding language on computers

September 3, 2017

I teach a 100 level course called Language and Computers to a class of undergrads at all stages in their degree programs, and coming from diverse backgrounds...

Teaching Notes - Teaching about spelling and grammar correction

September 10, 2017

This post is a continuation from my previous post on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a v...

Teaching Notes - Teaching about language tutoring systems

September 17, 2017

This post is a continuation from my previous posts - Part 1 Part 2 on teaching a 100-level undergraduate course called Language and Computers. As mentioned e...

Teaching Notes - Teaching about search

September 29, 2017

This post is a continuation from my previous posts - (Part 1, Part 2 and Part 3) on teaching a 100-level undergraduate course called Language and Computers. ...

Teaching Notes - Midterm presentations in 'Language and Computers'

October 21, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

Teaching Notes - Teaching about 'What is NLP?'

October 21, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

Teaching Notes - Text Classification

October 29, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

Teaching Notes - Dialog Systems and Conversational Bots

November 5, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

Teaching Notes - Speech Recognition and Synthesis

November 11, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

Teaching Notes - Machine Translation

December 23, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

Teaching Notes- Text Mining in R, for non-programmers

January 28, 2018

I teach a course “Language as Data” for introducing text processing and analysis methods to liberal arts majors. The course is a new experimental course firs...

Teaching Notes- First few weeks of text mining with R

February 21, 2018

This post is a continuation from my previous post on teaching a course “Language as Data” for introducing text processing and analysis methods to liberal art...

WorkingWithData (4)

Notes on Kaggle competitions

June 30, 2017

I participated in my first Kaggle competition in this month (Sberbank Russian Housing Market) along with @embedsri, and this post is a reflection on that.

Exploratory Factor Analysis in SPSS vs R

October 8, 2017

I got interested in Exploratory Factor Analysis (EFA) recently, thanks to some of the students with whom I work right now. They come from a background of sta...

Thoughts and Experiments with NER-2

June 14, 2018

In the last post, I wrote about the general idea of Named Entity Recognition, the issues I noticed in the way researchers discuss the problem in contemporary...

Some thoughts on Tokenization approaches in NLP

September 22, 2018

I was planning on writing more on Key phrase extraction and document tagging in NLP, but got occupied with work. In the meanwhile, I had a few discussions wi...