How to improve search on ?

The goal for any content driven website is to provide its users with relevant content based on Information Discovery. Most of the websites choose ‘search’ as a tool for information discovery (Netflix is a mind blowing exception to this) and in search, relevance is heavily dependent on context rather than just strings present in them. Not many websites realise this and I was surprised to see that , known for having one of the best content repositories,  giving barely useful results for my search queries.

Now, I am a happy subscriber of Harvard Business Review and have been using it often to enhance my limited knowledge by discovering interesting content around technology, business, leadership etc. and have absolutely no doubt on the high quality of its content and really great contributors. But, I am disappointed at how that content is becoming difficult to discover in the first place. Here is how its search looks like: search

All looks well, it has a  very familiar interface, like most of the other sites have and the first result does have the word I searched for “search”. Don’t go away yet, stay with me till the end 🙂

In my quest to learn more and follow everything about Artificial Intelligence I keep looking everywhere for anything related to AI. I thought let’s try what all is there in repository on AI. I search first for artificial intelligence and look at the results:

First result is a “Sales and Marketing” case study about an early stage company Empathetics (an organization that teaches empathy to healthcare professionals and staff to improve the patient experience) and I wonder why is that the top result (when sorted by relevance!) for what I searched for. I open it, scratch my head really hard to figure out what exactly is related to AI there but could not find anything. I scroll down and to my despair, the other results are also out of the world for me. Here they are:


Then I thought of trying the ‘exactness’ trick – I searched for “Artificial Intelligence” and boom!

ZERO results

ZERO results! Apparently it does not support exact string search, which ideally it should. I knew this is simply not true as does have articles on AI. Examples:

It gives irrelevant results in search even though the right content is very much there in repository.

Moving on, I tried taking my chances on AI – I search for ‘AI’. Here is what happens:

Do you notice it? There is an author Ai-Ling Jamila Malone whose name contains “AI” and simply is showing me all articles from the author. This means it is giving a higher score (probably) to words found in a wrong field (author field)  than the content itself and that too without any context.

Now it could have been deliberately done assuming most of the people want to search for names of authors but hey, that use case CAN be handled in a better way.

Moving further, I check for another hot topic – Deep Learning – and default (sorted by relevance) results appear to be relevant (see now it shows me results for artificial intelligence as well).

But the articles are old and I needed the latest ones – the moment I try sorting by publication date I see this:

the first result is this – – which may seem somewhat relevant given it has AI in its title but it is not even remotely related to “Deep Learning” and the only reason it appears in search results is because there is a word “deep” in one paragraph somewhere

and there is another paragraph with the word “learning” somewhere.

The other results are same –  — this tops the chart of weirdness for me.

they have the words “deep” and “learning” somewhere and hence it is being shown. An important point to consider – I want latest but still relevant results and fails at it. It does not identify ‘deep learning’ as a concept made of two words and is simply looking up the words appearing somewhere in the content.

The root causes for all of it can be summarised as follows:

  1. The search at is still relying on basic keyword based scoring and has no Ontology of concepts like “Artificial Intelligence” or “Deep Learning” or the relationships between the concepts.
  2. It does not account for synonyms and hence is unable to understand that “Artificial Intelligence” and “AI” are same concepts. An Ontology makes it much easier to maintain all synonyms of any given concept.
  3. It does not identify entities so is unable to differentiate between name of a person “Ai-Ling Jamila Malone” and a concept “AI”

Based on how we have designed information discovery for our Data as a Service (DaaS) platform I can say that the primary reason for all of the problems is the missing Ontology leading to missing Entity Recognition and disambiguation.

Search is important but in itself is not always the best way for information discovery. Users don’t get happy at seeing millions of search results for what they search – that only adds to information overload. What matters is, if you are telling me there are so many possible results out there, then tell me how they are distributed across different dimensions around my ‘interest’. Let me choose a direction and don’t force me to keep going through all the results in a linear fashion – nobody will live through to get to the end of the millionth result page. And not just Innoplexus but I know there are few other companies out there who are following this philosophy and making Information Discovery easier for their users. One of the examples I quoted above as well is Netflix.

True information discovery tool has to be a Digital Gyroscope helping one to explore the Data Universe by giving a sense of all possible directions and saving one from getting lost in the hyperspace.

Thanks to Ravi Ranjan for reviewing the article and helping with the headline 🙂

So you thought that Bitcoin was not being manipulated ?

Slow clap for all of those who thought Bitcoin (or other crypto currencies) were / are not being manipulated. Techcrunch reported on 15th Jan and MIT Technology Review reported on 16th Jan about a research which gives insights into how Bitcoin was being manipulated by two bots named Markus and Willy – in 2013. They performed valid trades but did not actually own any bitcoin themselves.

Entitled “Price Manipulation in the Bitcoin Ecosystem” and appearing in the recent issue of the Journal of Monetary Economics the paper describes to what degree the Bitcoin ecosystem is controlled by bad actors. Based on rigorous analysis with extensive robustness checks, the paper demonstrates that the suspicious trading activity likely caused the unprecedented spike in the USD-BTC exchange rate in late 2013, when the rate jumped from around $150 to more than $1,000 in two months.”

Now you can imagine if this happened in 2013 then what is happening today. For all its talk about decentralisation, the fact that it is so susceptible to such manipulations, should be a warning for everybody trying to ‘try their luck’ in Bitcoin or other crypto currencies.



Find a vision

Have you found a vision you identify with ?  At Innoplexus we have found a vision which we all believe in and are working together to bring it to reality.  This video by Simon Sinek   really puts it beautifully – not everyone needs to be a visionary. It’s just as inspiring to follow a vision that resonates with us as it is to have our own. Because it’s the followers, not the visionary, who bring the vision to life. It’s our responsibility to find a leader, find a company, find a vision, that we believe in and work to help build. 

Bill Gates on geeks

If your culture doesn’t like geeks, you are in real trouble. @Bill Gates said that in 2010. (Not) surprisingly, most of the companies still have not understood it. Look at the world’s most valuable companies – they know that for sure and made a strategy around that only. 

Here is the article where Bill Gates made that statement – Bill Gates: ‘If you don’t like geeks, you’re in trouble.’

Looking at the world’s best restaurants for Innovation

I came across this article on HBR today which looks at the world’s best restaurants to find out how they balance Innovation and Consistency.

Here’s the summary :

Despite being able to charge hundreds of dollars for a meal and being fully booked months in advance, top restaurants often still have a hard time turning a profit. And they face an added challenge of maintaining flawless consistency, while simultaneously being innovative and cutting-edge. This requires dedicated time and space for research and experimentation, as well as a thorough process for both iterating on and standardizing new inventions. Examples of restaurants that have made  both the Michelin Guide and 50 Best Restaurants of the World list show how they encourage creativity and learning beyond the leadership or lab teams, and generate, refine, and standardize ideas.

The original article is here –

All the projects follow a specific development process, alternating between collective ideation or feedback and focused work by a small team. For restaurant dishes, the development team will quickly prototype and iterate through numerous versions of the dish and its components, either in the lab or if a lab is not available, in the main kitchen during slow hours. The trials can go over for months as numerous variations are tested in a race against seasonal ingredients.

This just goes on to prove that Consistency and Creativity are not mutually exclusive – restaurants need to be both at the same time. Innovation which is not a result of a set process will  soon become unsustainable – one may be able to innovate even without process at times but no one can repeat the feat a number of times. That consistency is possible only with a process.


Entrepreneur vs Manager

What differentiates an Entrepreneur from a Manager ? While I still do not have a definite answer to it, I do have an anecdote, shared by one of my mentors few years back.

A group of lumberjacks entered a forest with their manager to carry out a survey on how much wood log will they be able to generate from there. Few of them gave the feedback to manager that most of the trees they found till that time did not have good quality and the trunks were not all straight. The manager asked them to extend the area of survey.

They went back and carried on. Two days passed by but the results were not encouraging. Third day, manager’s boss, the owner of the small company (the entrepreneur) reached there and asked about the progress. The manager presented the report with number of trees his team had surveyed, how many trees were of good quality, how many hours did his team spend in surveying etc. along with explaining that he has now extended the survey area to find more trees with better quality. He listened intently and asked his manager if he already has an idea of the tallest tree his team has found in the surveyed area. Manager, beaming with a sense of achievement, said yes and offered to take him to the tree.

Once the boss reached there, he surprised everybody by saying he wants to climb the tree. Manager said that it is a bit risky, that he should reconsider it and maybe ask one of the team members to climb instead. He refused and started climbing. Atop, he spent about five minutes looking around in all directions, came down and asked the manager “Call everybody back, we are leaving”. The manager was dumbfounded, saying we have invested so many days already, maybe we should wait till we find better trees, that the forest is so big. Owner simply replied “Wrong forest!”.

Originally published my me on Linkedin

नव वर्ष: समय

न यह अंत है, न यह आरंभ है
न यह नवीन है, न यह प्राचीन है
न यह स्वाधीन है, न यह पराधीन है

आकांक्षा भी यही और अनल भी
कृष्ण भी यही और कंस भी
राम भी यही और रावण भी

पुष्प भी यही है और भ्रमर भी यही है
अपरिमित, अजान, अबूझ, अनघ एक यही है
यह समय है, यह बस समय है

न यह गया है और न यह आने वाला है
आज है, अभी है, इसी क्षण है
यह बस है, यह बस है

इसी अनुभूति के साथ नव वर्ष २०१८ की शुभकामनायें |