SAN JOSE, Calif. — Google isn’t only crawling your Web site, it’s looking at your source code as well. That was the introduction for Chris DiBona, the open source programs manager at Google (NASDAQ: GOOG), one of several keynote speakers here at OSCON 2009 today.
While Google Code Search has been available through Google Labs for some time, DiBona revealed some telling — and potentially — unexpected findings about the state of open source.
He said Google’s found some 2.5 billion lines of open source code spread across 30 million unique files that the search giant’s identified via its code search crawl. He then played a little quiz game with the audience, asking a series of questions about Google’s discoveries.
For example: “Is there more C or C++ open source programs?” Answer: More than twice as many are based on C.
More Perl or PHP? More PHP, by more than 37 million lines of code.
Smalltalk versus Objective C? Smalltalk wins by nearly three times as much.
Troff or Ruby? Troff by more than 88 million lines of code.
DiBona admitted there’s probably a certain amount of error in the calculations because of project duplication and other factors, but joked “we make it up in volume.”
Microsoft has been making big moves into open source lately, so its appearance at OSCON wasn’t a surprise. Tony Hey, the software giant’s corporate vice president of external research, made a pitch for how open source is currently helping the scientific community, and what it can do in the future.
“I believe there is a great opportunity for Microsoft and other companies to help scientists solve problems,” he said. “Science has to move from data to information to knowledge.”
He noted that with the increase in sensor networks and myriad other sources, scientists can easily be overwhelmed by the flood of data they have access to. That makes it important that scientists are able to use the best tools for the job, whether they are proprietary or open source-based.
“We’re trying to give people choice and that means Microsoft, open source, Google, Oracle and IBM,” Hey said. “We want to help scientists spend less time on IT issues and more on discovery.
For instance, he mentioned a Microsoft project called PhloD, a statistical tool used to analyze the DNA from large pools of HIV patients. The idea is to try to find correlations between how the HIV virus changes.
Hey said Microsoft offers PhloD as an Azure service, part of its cloud computing initiative. “Scientists can upload data to the cloud and do their analysis without intervention,” he said.
He also noted that Microsoft now offers a number of open source extensions to Office and some of its other products that make them more accessible.
Article courtesy of InternetNews.com.
Huawei’s AI Update: Things Are Moving Faster Than We Think
FEATURE | By Rob Enderle,
December 04, 2020
Keeping Machine Learning Algorithms Honest in the ‘Ethics-First’ Era
ARTIFICIAL INTELLIGENCE | By Guest Author,
November 18, 2020
Key Trends in Chatbots and RPA
FEATURE | By Guest Author,
November 10, 2020
FEATURE | By Samuel Greengard,
November 05, 2020
ARTIFICIAL INTELLIGENCE | By Guest Author,
November 02, 2020
How Intel’s Work With Autonomous Cars Could Redefine General Purpose AI
ARTIFICIAL INTELLIGENCE | By Rob Enderle,
October 29, 2020
Dell Technologies World: Weaving Together Human And Machine Interaction For AI And Robotics
ARTIFICIAL INTELLIGENCE | By Rob Enderle,
October 23, 2020
The Super Moderator, or How IBM Project Debater Could Save Social Media
FEATURE | By Rob Enderle,
October 16, 2020
FEATURE | By Cynthia Harvey,
October 07, 2020
ARTIFICIAL INTELLIGENCE | By Guest Author,
October 05, 2020
CIOs Discuss the Promise of AI and Data Science
FEATURE | By Guest Author,
September 25, 2020
Microsoft Is Building An AI Product That Could Predict The Future
FEATURE | By Rob Enderle,
September 25, 2020
Top 10 Machine Learning Companies 2020
FEATURE | By Cynthia Harvey,
September 22, 2020
NVIDIA and ARM: Massively Changing The AI Landscape
ARTIFICIAL INTELLIGENCE | By Rob Enderle,
September 18, 2020
Continuous Intelligence: Expert Discussion [Video and Podcast]
ARTIFICIAL INTELLIGENCE | By James Maguire,
September 14, 2020
Artificial Intelligence: Governance and Ethics [Video]
ARTIFICIAL INTELLIGENCE | By James Maguire,
September 13, 2020
IBM Watson At The US Open: Showcasing The Power Of A Mature Enterprise-Class AI
FEATURE | By Rob Enderle,
September 11, 2020
Artificial Intelligence: Perception vs. Reality
FEATURE | By James Maguire,
September 09, 2020
Anticipating The Coming Wave Of AI Enhanced PCs
FEATURE | By Rob Enderle,
September 05, 2020
The Critical Nature Of IBM’s NLP (Natural Language Processing) Effort
ARTIFICIAL INTELLIGENCE | By Rob Enderle,
August 14, 2020
Datamation is the leading industry resource for B2B data professionals and technology buyers. Datamation's focus is on providing insight into the latest trends and innovation in AI, data security, big data, and more, along with in-depth product recommendations and comparisons. More than 1.7M users gain insight and guidance from Datamation every year.
Advertise with TechnologyAdvice on Datamation and our other data and technology-focused platforms.
Advertise with Us
Property of TechnologyAdvice.
© 2025 TechnologyAdvice. All Rights Reserved
Advertiser Disclosure: Some of the products that appear on this
site are from companies from which TechnologyAdvice receives
compensation. This compensation may impact how and where products
appear on this site including, for example, the order in which
they appear. TechnologyAdvice does not include all companies
or all types of products available in the marketplace.