CL/NLP (1)
COLING2020 (3)
DeepLearning (1)
Embeddings (2)
JointLearning (1)
LING120 (10)
MachineLearning (2)
May2020Notes (17)
NLP (26)
QG/QA/RC (5)
R (3)
Treebanking (2)
410X (2)
Teaching Notes- Text Mining in R, for non-programmers January 28, 2018
I teach a course “Language as Data” for introducing text processing and analysis methods to liberal arts majors. The course is a new experimental course firs...
Teaching Notes- First few weeks of text mining with R February 21, 2018
This post is a continuation from my previous post on teaching a course “Language as Data” for introducing text processing and analysis methods to liberal art...
CL/NLP (1)
A 1980s discussion in CL Journal on parsing ill-formed input March 9, 2018
Having 5 hours to kill in the airport, and not wanting to browse, I started checking the old issues of Computational Liguistics journal, which was then calle...
COLING2020 (3)
Day1@COLING 2020 December 8, 2020
I am attending COLING 2020 and here are some notes about what I found interesting in today’s program. Note that I did not read them thoroughly. I just used t...
Day2@COLING 2020 December 9, 2020
I am attending COLING 2020 and here are some notes about what I found interesting in Day 2 (9th December 2020) program. Note that I did not read them thoroug...
Last 2 days @COLING 2020 December 10, 2020
I attended COLING 2020 two weeks ago and started blogging about day to day learnings. After 2 days, I gave up due to conference fatigue and some family situa...
DeepLearning (1)
Reading Notes - A Critical Appraisal of Deep Learning January 3, 2018
This are some notes I am making about the following article I came across on ArXiv.org:
Embeddings (2)
Reading Notes - Text Preprocessing in Neural Network Architectures July 7, 2017
I came across this short paper today in the ArXiv digest. This post is a quick summary.
Reading Notes - Evaluating the stability of embedding based similarities March 5, 2018
This post is a short summary of the following research paper
JointLearning (1)
Reading Notes - A Joint Segmentation and Classification Framework for Sentiment Analysis June 29, 2017
This is a reading summary of the following paper:
LING120 (10)
Teaching Notes - Teaching about encoding language on computers September 3, 2017
I teach a 100 level course called Language and Computers to a class of undergrads at all stages in their degree programs, and coming from diverse backgrounds...
Teaching Notes - Teaching about spelling and grammar correction September 10, 2017
This post is a continuation from my previous post on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a v...
Teaching Notes - Teaching about language tutoring systems September 17, 2017
This post is a continuation from my previous posts - Part 1 Part 2 on teaching a 100-level undergraduate course called Language and Computers. As mentioned e...
Teaching Notes - Teaching about search September 29, 2017
This post is a continuation from my previous posts - (Part 1, Part 2 and Part 3) on teaching a 100-level undergraduate course called Language and Computers. ...
Teaching Notes - Midterm presentations in 'Language and Computers' October 21, 2017
This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...
Teaching Notes - Teaching about 'What is NLP?' October 21, 2017
This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...
Teaching Notes - Text Classification October 29, 2017
This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...
Teaching Notes - Dialog Systems and Conversational Bots November 5, 2017
This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...
Teaching Notes - Speech Recognition and Synthesis November 11, 2017
This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...
Teaching Notes - Machine Translation December 23, 2017
This post is a continuation from my previous posts on teaching a 100-level undergraduate course called Language and Computers. As mentioned earlier, it is a ...
MachineLearning (2)
Reading Notes - Cost-Sensitive Learning vs. Sampling July 6, 2017
This is a quick summary of this short paper:
Reading Notes - Class Imbalance, Redux July 10, 2017
This post is a summary of the following paper
May2020Notes (17)
Quick Notes - Day 1 April 30, 2020
My parental leave ends in a month, and so, I was considering getting back into some regular reading habit. I felt committing myself to writing quick notes on...
Quick Notes - Day 2 May 1, 2020
All previous posts in this series here
Quick Notes - Day 3 May 2, 2020
All previous posts in this series here
Quick Notes - Day 4 May 3, 2020
All previous posts in this series here
Quick Notes - Day 5 May 4, 2020
All previous posts in this series here
Quick Notes - Day 6 May 5, 2020
All previous posts in this series here
Quick Notes - Day 7 May 6, 2020
All previous posts in this series here
Quick Notes - Day 8 May 7, 2020
All previous posts in this series here
Quick Notes - Day 9 May 8, 2020
All previous posts in this series here
Quick Notes - Day 10 May 9, 2020
All previous posts in this series here
Quick Notes - Day 11 May 10, 2020
All previous posts in this series here
Quick Notes - Day 12 May 11, 2020
All previous posts in this series here
Quick Notes - Day 13 May 12, 2020
All previous posts in this series here
Quick Notes - Day 14 May 13, 2020
All previous posts in this series here
Quick Notes - Day 15 May 14, 2020
All previous posts in this series here
Quick Notes - Day 16 May 15, 2020
All previous posts in this series here
Quick Notes - Day 19 May 18, 2020
All previous posts in this series here (Days 17 and 18 are included in one post with 4 papers)
NLP (26)
Reading Notes - Baselines and Bigrams January 5, 2018
Today, I came across a tweet by François Chollet:
Telugu Treebank in Universal Dependencies January 23, 2018
Several months back, in July 2017, I wrote about Universal Dependencies and wondered about not seeing anything for Telugu. Thanks to @phylostar, I ended up i...
Readings and Thoughts on deploying NLP/ML models June 28, 2018
Over the past few weeks, I spent quite a bit of time reading and thinking about the difference between academic and industry R&D for NLP and Machine Lear...
Keyphrase Extraction from Documents - 1 July 31, 2018
For the past few weeks, I have been working on automatic keyphrase extraction from documents. This post is the first in (hopefully) a series of posts to note...
Some thoughts on Tokenization approaches in NLP September 22, 2018
I was planning on writing more on Key phrase extraction and document tagging in NLP, but got occupied with work. In the meanwhile, I had a few discussions wi...
The problem with off-the-shelf Named Entity Recognizers November 13, 2018
A few months back, I wrote two posts about Named Entity Recognition (Part 1, Part 2), which is the language processing task that is concerned with identifyin...
Reading Notes - FastText embeddings and classification March 27, 2019
After about 4 months of break, I feel compelled to write a blog post on some readings again! I was working with fastText over the past few weeks, as a part o...
Keyphrase Extraction from Documents - 2 April 2, 2019
Several months ago, I started writing on automatic keyphrase extraction, but couldn’t continue. I was building a keyphrase extractor for legal documents at t...
Quick Notes - Day 1 April 30, 2020
My parental leave ends in a month, and so, I was considering getting back into some regular reading habit. I felt committing myself to writing quick notes on...
Quick Notes - Day 2 May 1, 2020
All previous posts in this series here
Quick Notes - Day 3 May 2, 2020
All previous posts in this series here
Quick Notes - Day 4 May 3, 2020
All previous posts in this series here
Quick Notes - Day 5 May 4, 2020
All previous posts in this series here
Quick Notes - Day 6 May 5, 2020
All previous posts in this series here
Quick Notes - Day 7 May 6, 2020
All previous posts in this series here
Quick Notes - Day 8 May 7, 2020
All previous posts in this series here
Quick Notes - Day 9 May 8, 2020
All previous posts in this series here
Quick Notes - Day 10 May 9, 2020
All previous posts in this series here
Quick Notes - Day 11 May 10, 2020
All previous posts in this series here
Quick Notes - Day 12 May 11, 2020
All previous posts in this series here
Quick Notes - Day 13 May 12, 2020
All previous posts in this series here
Quick Notes - Day 14 May 13, 2020
All previous posts in this series here
Quick Notes - Day 15 May 14, 2020
All previous posts in this series here
Quick Notes - Day 16 May 15, 2020
All previous posts in this series here
REPROLANG 2020 Language Proficiency Scoring Task - A review May 16, 2020
In 2018, Taraka Rama and I published a paper“Experiments with Universal CEFR Classification”. At that time, we shared the paper’s code, along with the result...
Quick Notes - Day 19 May 18, 2020
All previous posts in this series here (Days 17 and 18 are included in one post with 4 papers)
QG/QA/RC (5)
Reading Notes - A Joint Model for Question Answering and Question Generation June 23, 2017
This post is my quick summary of what I understood an article I read, by a team from Microsoft Maluuba.
Reading Notes - Question Answering and Question Generation as Dual Tasks June 23, 2017
I have been reading about automatic question generation on and off for about an year now. There has been an increase in stuff that is coming up since last ye...
Reading Notes - Active comprehension (1982 article) June 27, 2017
This post is my quick note-taking about the following 1982 article: Singer, Harry and Donlan, Dan. Active comprehension: Problem-solving schema wit...
Reading Notes - Ranking Automatically Generated Questions Using Common Human Queries June 27, 2017
This post is about the following short paper:
Reading Notes - Generating Natural Language Questions to Support Learning On-Line June 27, 2017
Generating Natural Language Questions to Support Learning On-Line
R (3)
Exploratory Factor Analysis in SPSS vs R October 8, 2017
I got interested in Exploratory Factor Analysis (EFA) recently, thanks to some of the students with whom I work right now. They come from a background of sta...
Teaching Notes- Text Mining in R, for non-programmers January 28, 2018
I teach a course “Language as Data” for introducing text processing and analysis methods to liberal arts majors. The course is a new experimental course firs...
Teaching Notes- First few weeks of text mining with R February 21, 2018
This post is a continuation from my previous post on teaching a course “Language as Data” for introducing text processing and analysis methods to liberal art...
Treebanking (2)
Universal Dependencies for several languages July 22, 2017
Thanks to the recent discussions with @phylostar, I have been reading quite a bit on Universal Dependencies(UD) and on the creation of manual annotations fol...
Telugu Treebank in Universal Dependencies January 23, 2018
Several months back, in July 2017, I wrote about Universal Dependencies and wondered about not seeing anything for Telugu. Thanks to @phylostar, I ended up i...