Posts by Tags

410X (2)
CL/NLP (1)
COLING2020 (3)
DeepLearning (1)
Embeddings (2)
JointLearning (1)
LING120 (10)
MachineLearning (2)
May2020Notes (17)
NLP (26)
QG/QA/RC (5)
R (3)
Treebanking (2)

410X (2)

Teaching Notes- Text Mining in R, for non-programmers

January 28, 2018

I teach a course “Language as Data” for introducing text processing and analysis methods to liberal arts majors. The course is a new experimental course firs...

Teaching Notes- First few weeks of text mining with R

February 21, 2018

This post is a continuation from my previous post on teaching a course “Language as Data” for introducing text processing and analysis methods to liberal art...

CL/NLP (1)

A 1980s discussion in CL Journal on parsing ill-formed input

March 9, 2018

Having 5 hours to kill in the airport, and not wanting to browse, I started checking the old issues of Computational Liguistics journal, which was then calle...

COLING2020 (3)

Day1@COLING 2020

December 8, 2020

I am attending COLING 2020 and here are some notes about what I found interesting in today’s program. Note that I did not read them thoroughly. I just used t...

Day2@COLING 2020

December 9, 2020

I am attending COLING 2020 and here are some notes about what I found interesting in Day 2 (9th December 2020) program. Note that I did not read them thoroug...

Last 2 days @COLING 2020

December 10, 2020

I attended COLING 2020 two weeks ago and started blogging about day to day learnings. After 2 days, I gave up due to conference fatigue and some family situa...

DeepLearning (1)

Reading Notes - A Critical Appraisal of Deep Learning

January 3, 2018

This are some notes I am making about the following article I came across on ArXiv.org:

Embeddings (2)

Reading Notes - Text Preprocessing in Neural Network Architectures

July 7, 2017

I came across this short paper today in the ArXiv digest. This post is a quick summary.

Reading Notes - Evaluating the stability of embedding based similarities

March 5, 2018

This post is a short summary of the following research paper

JointLearning (1)

Reading Notes - A Joint Segmentation and Classification Framework for Sentiment Analysis

June 29, 2017

This is a reading summary of the following paper:

LING120 (10)

Teaching Notes - Teaching about encoding language on computers

September 3, 2017

I teach a 100 level course called Language and Computers to a class of undergrads at all stages in their degree programs, and coming from diverse backgrounds...

Teaching Notes - Teaching about spelling and grammar correction

September 10, 2017

This post is a continuation from my previous post on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a v...

Teaching Notes - Teaching about language tutoring systems

September 17, 2017

This post is a continuation from my previous posts - Part 1 Part 2 on teaching a 100-level undergraduate course called Language and Computers. As mentioned e...

Teaching Notes - Teaching about search

September 29, 2017

This post is a continuation from my previous posts - (Part 1, Part 2 and Part 3) on teaching a 100-level undergraduate course called Language and Computers. ...

Teaching Notes - Midterm presentations in 'Language and Computers'

October 21, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

Teaching Notes - Teaching about 'What is NLP?'

October 21, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

Teaching Notes - Text Classification

October 29, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

Teaching Notes - Dialog Systems and Conversational Bots

November 5, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

Teaching Notes - Speech Recognition and Synthesis

November 11, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

Teaching Notes - Machine Translation

December 23, 2017

This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...

MachineLearning (2)

Reading Notes - Cost-Sensitive Learning vs. Sampling

July 6, 2017

This is a quick summary of this short paper:

Reading Notes - Class Imbalance, Redux

July 10, 2017

This post is a summary of the following paper

May2020Notes (17)

Quick Notes - Day 1

April 30, 2020

My parental leave ends in a month, and so, I was considering getting back into some regular reading habit. I felt committing myself to writing quick notes on...

Quick Notes - Day 2

May 1, 2020

All previous posts in this series here

Quick Notes - Day 3

May 2, 2020

All previous posts in this series here

Quick Notes - Day 4

May 3, 2020

All previous posts in this series here

Quick Notes - Day 5

May 4, 2020

All previous posts in this series here

Quick Notes - Day 6

May 5, 2020

All previous posts in this series here

Quick Notes - Day 7

May 6, 2020

All previous posts in this series here

Quick Notes - Day 8

May 7, 2020

All previous posts in this series here

Quick Notes - Day 9

May 8, 2020

All previous posts in this series here

Quick Notes - Day 10

May 9, 2020

All previous posts in this series here

Quick Notes - Day 11

May 10, 2020

All previous posts in this series here

Quick Notes - Day 12

May 11, 2020

All previous posts in this series here

Quick Notes - Day 13

May 12, 2020

All previous posts in this series here

Quick Notes - Day 14

May 13, 2020

All previous posts in this series here

Quick Notes - Day 15

May 14, 2020

All previous posts in this series here

Quick Notes - Day 16

May 15, 2020

All previous posts in this series here

Quick Notes - Day 19

May 18, 2020

All previous posts in this series here (Days 17 and 18 are included in one post with 4 papers)

NLP (26)

Reading Notes - Baselines and Bigrams

January 5, 2018

Today, I came across a tweet by François Chollet:

Telugu Treebank in Universal Dependencies

January 23, 2018

Several months back, in July 2017, I wrote about Universal Dependencies and wondered about not seeing anything for Telugu. Thanks to @phylostar, I ended up i...

Readings and Thoughts on deploying NLP/ML models

June 28, 2018

Over the past few weeks, I spent quite a bit of time reading and thinking about the difference between academic and industry R&D for NLP and Machine Lear...

Keyphrase Extraction from Documents - 1

July 31, 2018

For the past few weeks, I have been working on automatic keyphrase extraction from documents. This post is the first in (hopefully) a series of posts to note...

Some thoughts on Tokenization approaches in NLP

September 22, 2018

I was planning on writing more on Key phrase extraction and document tagging in NLP, but got occupied with work. In the meanwhile, I had a few discussions wi...

The problem with off-the-shelf Named Entity Recognizers

November 13, 2018

A few months back, I wrote two posts about Named Entity Recognition (Part 1, Part 2), which is the language processing task that is concerned with identifyin...

Reading Notes - FastText embeddings and classification

March 27, 2019

After about 4 months of break, I feel compelled to write a blog post on some readings again! I was working with fastText over the past few weeks, as a part o...

Keyphrase Extraction from Documents - 2

April 2, 2019

Several months ago, I started writing on automatic keyphrase extraction, but couldn’t continue. I was building a keyphrase extractor for legal documents at t...

Quick Notes - Day 1

April 30, 2020

My parental leave ends in a month, and so, I was considering getting back into some regular reading habit. I felt committing myself to writing quick notes on...

Quick Notes - Day 2

May 1, 2020

All previous posts in this series here

Quick Notes - Day 3

May 2, 2020

All previous posts in this series here

Quick Notes - Day 4

May 3, 2020

All previous posts in this series here

Quick Notes - Day 5

May 4, 2020

All previous posts in this series here

Quick Notes - Day 6

May 5, 2020

All previous posts in this series here

Quick Notes - Day 7

May 6, 2020

All previous posts in this series here

Quick Notes - Day 8

May 7, 2020

All previous posts in this series here

Quick Notes - Day 9

May 8, 2020

All previous posts in this series here

Quick Notes - Day 10

May 9, 2020

All previous posts in this series here

Quick Notes - Day 11

May 10, 2020

All previous posts in this series here

Quick Notes - Day 12

May 11, 2020

All previous posts in this series here

Quick Notes - Day 13

May 12, 2020

All previous posts in this series here

Quick Notes - Day 14

May 13, 2020

All previous posts in this series here

Quick Notes - Day 15

May 14, 2020

All previous posts in this series here

Quick Notes - Day 16

May 15, 2020

All previous posts in this series here

REPROLANG 2020 Language Proficiency Scoring Task - A review

May 16, 2020

In 2018, Taraka Rama and I published a paper“Experiments with Universal CEFR Classification”. At that time, we shared the paper’s code, along with the result...

Quick Notes - Day 19

May 18, 2020

All previous posts in this series here (Days 17 and 18 are included in one post with 4 papers)

QG/QA/RC (5)

Reading Notes - A Joint Model for Question Answering and Question Generation

June 23, 2017

This post is my quick summary of what I understood an article I read, by a team from Microsoft Maluuba.

Reading Notes - Question Answering and Question Generation as Dual Tasks

June 23, 2017

I have been reading about automatic question generation on and off for about an year now. There has been an increase in stuff that is coming up since last ye...

Reading Notes - Active comprehension (1982 article)

June 27, 2017

This post is my quick note-taking about the following 1982 article: Singer, Harry and Donlan, Dan. Active comprehension: Problem-solving schema wit...

Reading Notes - Ranking Automatically Generated Questions Using Common Human Queries

June 27, 2017

This post is about the following short paper:

Reading Notes - Generating Natural Language Questions to Support Learning On-Line

June 27, 2017

Generating Natural Language Questions to Support Learning On-Line

R (3)

Exploratory Factor Analysis in SPSS vs R

October 8, 2017

I got interested in Exploratory Factor Analysis (EFA) recently, thanks to some of the students with whom I work right now. They come from a background of sta...

Teaching Notes- Text Mining in R, for non-programmers

January 28, 2018

I teach a course “Language as Data” for introducing text processing and analysis methods to liberal arts majors. The course is a new experimental course firs...

Teaching Notes- First few weeks of text mining with R

February 21, 2018

This post is a continuation from my previous post on teaching a course “Language as Data” for introducing text processing and analysis methods to liberal art...

Treebanking (2)

Universal Dependencies for several languages

July 22, 2017

Thanks to the recent discussions with @phylostar, I have been reading quite a bit on Universal Dependencies(UD) and on the creation of manual annotations fol...

Telugu Treebank in Universal Dependencies

January 23, 2018

Several months back, in July 2017, I wrote about Universal Dependencies and wondered about not seeing anything for Telugu. Thanks to @phylostar, I ended up i...