The Business Intelligence Blog

Slicing Business Dicing Intelligence

Archive for the ‘Data Mining’ tag

MS in Verticals – Buys Predictive Analytics company, Farecast  

Seattle Pi’s Venture Blog has the full story from the start to the end.

Farecast was started by University of Washington computer scientist Oren Etzioni, initially bankrolled by Madrona, built with people from local companies such as Alaska Airlines and AdRelevance and, ultimately, acquired by Microsoft.

Though Farecast had multiple bidders, McIlwain said Microsoft was a good fit since the two companies had worked together in the past and had a similar vision for online search. The proximity of the two companies also played a part, he said.

The acquisition follows the merger of Kayak.com and SideStep, the market leader in next generation travel search. That deal led to new opportunities for Farecast, including discussions with Microsoft which heated up in the past 90 days.

“That consolidation presented opportunities for Farecast … partly differentiated because of their predictive capabilities but also because of who they might have been able to align with in the industry to be a strong and differentiated number two, hoping some day to overtake and become number one,” he said.

Madrona has produced a number of hits recently, with the sales of ShareBuilder, World Wide Packets and iConclude.

Also a quick analysis from Motel Fool on this buy -

Microsoft needs more deals like this one, especially if the Microhoo deal comes undone, and the software giant has the means to go shopping. I’ve suggested that Microsoft pursue potential buyout candidates like The Knot (Nasdaq: KNOT) and Bankrate (Nasdaq: RATE) for the same reason that Farecast works. Whether it’s wedding planning, home refinancing, or booking that flight to visit your parents in Chicago, this is the quality traffic that Microsoft and Yahoo! lack right now.

The article has

no responses yet

Written by Guru Kirthigavasan

April 18th, 2008 at 8:41 am

Stanford students working on Netflix Algorithms  

Anand Rajaraman, the co-founder of Kosmix also teaches Data Mining at Stanford. Here’s an interesting note from his blog.

Some of his students are working to crack algorithms for the on-going Netflix “Better Recommendation Logic” Prize of $1 million. Read it !!

Here’s how the competition works. Netflix has provided a large data set that tells you how nearly half a million people have rated about 18,000 movies. Based on these ratings, you are asked to predict the ratings of these users for movies in the set that they have not rated. The first team to beat the accuracy of Netflix’s proprietary algorithm by a certain margin wins a prize of $1 million!

Different student teams in my class adopted different approaches to the problem, using both published algorithms and novel ideas. Of these, the results from two of the teams illustrate a broader point. Team A came up with a very sophisticated algorithm using the Netflix data. Team B used a very simple algorithm, but they added in additional data beyond the Netflix set: information about movie genres from the Internet Movie Database (IMDB). Guess which team did better?

The article has

no responses yet

Written by Guru Kirthigavasan

April 2nd, 2008 at 7:45 am

Baseball Association Analyzes Statistics with Cognos  

Its interesting that more and more sports associations are starting to use Business Intelligence software to analyze statistics. As Ian Ayres points out in his latest book, Super Crunchers, the competition between the traditional experts and number crunching softwares has ended. And number crunching softwares are being increasingly used by tranditional “intutional” experts to analyze the data better.

These are clearly the days of Data Mining Softwares. This one is about IBM Cognos. Read more -

“Our analysis of player performance is as complex and dynamic as the work of high-powered business analysts in Fortune 500 companies, and we need to use the same robust, flexible interface to achieve reliable results,” said Doyle Pryor, Assistant General Counsel of the MLBPA. “Conducting complex analysis in real-time allows us to improve our planning processes and IBM Cognos TM1 Executive Viewer enables the agents themselves to view reports and perform almost limitless ‘what-if’ scenarios for further analysis of the data.”

“The interface for analysis will provide sophisticated users with the tools they’re familiar with and the ability to quickly modify views and reports with as little effort as possible,” said Doug Barton, vice president, product marketing, Cognos, an IBM Company. “Users of IBM Cognos TM1 Executive Viewer continue to gravitate to its features that provide interactivity, immediacy, and flexibility, which, in turn, enable them to accelerate the management of their business’s performance.”

The article has

no responses yet

Written by Guru Kirthigavasan

March 28th, 2008 at 9:38 am

Data Visualization Helps Panoratio Data Mining Users  

From the Press Release -

Panoratio, a provider of innovative technology that maps statistical content from large and complex datasets, has selected OpenViz data visualization software from Advanced Visual Systems (AVS) to be incorporated into its Data Explorer product.

Panoratio uses OpenViz to provide highly interactive and graphical displays of dense imagery in near-real time, with virtually no restrictions on the complexity or amount of data that can be analyzed.

Panoratio’s Data Explorer is a smart data analysis tool that rapidly queries Panoratio Portable Database Images and delivers results in seconds with built-in intelligence that assists analysts in finding patterns and relationships in the data which they might not otherwise discover.

According to Dr. Oliver Mihatsch, Chief Technology Officer of Panoratio, “We selected OpenViz because it was by far the most flexible data visualization system and was more-than-able to meet the real-time demands of our high performance data mining technology.”

Independent software makers such as Panoratio use OpenViz to serve as an embedded graphics platform for interactive analytics and data visualization. Designed to overcome the limitations of static charting packages, OpenViz enables application designers and product managers to create high performance solutions from extremely complex data, algorithms and integrated corporate content.

The article has

no responses yet

Written by Guru Kirthigavasan

March 27th, 2008 at 2:04 pm

Deepest Data Mining  

New York times ran a story last week about a online data mining company called Phorm. While the data that the company mnes is controversial, they are starting to be talked about in the industry. Read more about Phorm at NYT-

Amid debate over how much data companies like Google and Yahoo should gather about people who surf the Web, one new company is drawing attention — and controversy — by boasting that it will collect the most complete information of all.

The company, called Phorm, has created a tool that can track every single online action of a given consumer, based on data from that person’s Internet service provider. The trick for Phorm is to gain access to that data, and it is trying to negotiate deals with telephone and cable companies, like AT&T, Verizon and Comcast, that provide broadband service to millions.

Phorm’s pitch to these companies is that its software can give them a new stream of revenue from advertising. Using Phorm’s comprehensive views of individuals, the companies can help advertisers show different ads to people based on their interests.

The article has

no responses yet

Written by Guru Kirthigavasan

March 26th, 2008 at 8:54 am

Posted in BI Vendors,Data Mining

Tagged with , ,

Reality Mining and Surprise Modeling – Future Tech  

Reading this Technology Review, it seems inevitable that such advanced mining technologies will pop-up in the near future. The world has a wealth of information and every single thing will be data mined in the future. And what a movement that will be.

By the way, the MIT Technology Review calls Reality Mining as one of the 10 technologies that we think are most likely to change the way we live. Exciting, Ain’t it ?

Also Surprise Modeling which combines data mining and machine learning to help people do a better job of anticipating and coping with unusual events is also one of the Top 10 Technologies listed by MIT Tech Review. This is being advocated by Eric Horvitz, Microsoft Research.

From the article on Reality Mining -

Reality mining, he says, “is all about paying attention to patterns in life and using that information to help [with] things like setting privacy patterns, sharing things with people, notifying people–basically, to help you live your life.”

Within the next few years, Pentland predicts, reality mining will become more common, thanks in part to the proliferation and increasing sophistication of cell phones. Many handheld devices now have the processing power of low-end desktop computers, and they can also collect more varied data, thanks to devices such as GPS chips that track location. And researchers such as Pentland are getting better at making sense of all that information.

To create an accurate model of a person’s social network, for example, Pentland’s team combines a phone’s call logs with information about its proximity to other people’s devices, which is continuously collected by Bluetooth sensors. With the help of factor analysis, a statistical technique commonly used in the social sciences to explain correlations among multiple variables, the team identifies patterns in the data and translates them into maps of social relationships. Such maps could be used, for instance, to accurately categorize the people in your address book as friends, family members, acquaintances, or coworkers. In turn, this information could be used to automatically establish privacy settings–for instance, allowing only your family to view your schedule. With location data added in, the phone could predict when you would be near someone in your network. In a paper published last May, ­Pentland and his group showed that cell-phone data enabled them to accurately model the social networks of about 100 MIT students and professors. They could also precisely predict where subjects would meet with members of their networks on any given day of the week.

The article has

one response

Written by Guru Kirthigavasan

February 21st, 2008 at 8:37 am

Presentation Two-Phase Arch for Mining Time Series Data  

Presentations without explanation or slide notes are always cryptic.

But while browsing Slideshare, I found this neat presentation on Extended Two Phase Architecture for data mining time series data. If you are in the field of data mining, this wouldn’t be hard to understand and its a must read too.

On the other hand, if you are a newbie at Data Mining, while there are numerous sources for Data Minin literature on the web, here is a simple, Data Mining Concepts presentation.

Also, the Microsoft version of Data Mining for Developers, introduces Data Mining concepts determining the problems between traditional Business Intelligence and Predictive Analytics. Very Interesting !!

The article has

4 responses

Written by Guru Kirthigavasan

February 2nd, 2008 at 10:16 am