AiLight - An Explanation

05/06/2020


Tl;dr: Most online summarizers cut out information. I've made AiLight, a web app that automatically highlights summaries within pdfs so that while important points are distinguished all information is still viewable.



Reading assignments are common in secondary school and higher education. It's essentially a minimax problem, at least for topics that you hate. How do you maximize your teachers impression of your comprehension while minimizing the time spent reading it? People usually come to a conclusion that details are mainly frivolous, and the main ideas are what's important (i.e. tested through a discussion or a quiz). The game then turns into extracting the main ideas of the article with minimal effort. One of the most interesting cheat codes for this problem is the use of automatic summarization. People would run the article through a sumarizer, which would then return a shorter version of the article to read, cutting out anywhere from 70% to 90% of the information. While this may work great in theory, it comes at a price. If you only read the summary, you dont even glance at the details. Some of these details may be questioned on the test. SO the only way to solve this would be to read both the summary and the original text, which makes the summarizer pointless!


I personally faced this problem in my Psychology class, and created AiLight instead of reading the assigned papers. AiLight is an algorithm that highlights the important points within the paper, so that as I skimmed the highlightings I could also glance at the surrounding details. It's a one-read version of automatic summarization.


This solution ended up working better than I thought. The ability to have the summary highlighted allowed me to see each important point in context. It turned the summary from a distillation into a roadmap, providing signage for each major point. This especially helped for longer papers with few paragraph breaks. I tend to remember where text is on the page and use that to refer back to the paper. For these kinds of papers it is harder to distinguish where exactly the desired information is on the page, so learning positions in relations to highlightings makes it easier to dissect the paper.


I decided to make a website public at https://anishthite.github.io/ailight where anyone can use the service to highlight PDFs. The algorithm is far from perfect. I'm currently working on it to provide the same benefit for more file types and languages, as well as improved efficency and highlighting accuracy. If you use the service, let me know what you think!