Google Patents Indexing, Retrieval of Blogs

This week, Google Inc. was assigned a patent by the United States Patent and Trademark Office USPTO for the invention of a system and method for indexing and retrieval of blogs.

You can find details of this new patent 7765209 in IP.com's Intellectual Property Library, which includes detailed information for this patent, first applied for in 2005 and granted in 2010 for what appears to be the core patent for Google's system and method of indexing blogs, which are now included in the results of a Google search.

The patent is abstracted as follows: "A system may receive a feed associated with a blog. The system may extract information from the feed and the blog and create a hybrid document based on the extracted information. The system may further use the hybrid document to determine a relevance of the blog to a search query."

 


BACKGROUND OF THE INVENTION

The World Wide Web (“web”) contains a vast amount of information. Locating a desired portion of the information, however, can be challenging. This problem is compounded because the amount of information on the web and the number of new users inexperienced at web searching are growing rapidly.

Search engines attempt to return hyperlinks to web pages in which a user is interested. Generally, search engines base their determination of the user's interest on search terms (called a search query) entered by the user. The goal of the search engine is to provide links to high quality, relevant results (e.g., web pages) to the user based on the search query. Typically, the search engine accomplishes this by matching the terms in the search query to a corpus of pre-stored web pages. Web pages that contain the user's search terms are identified as search results and are returned to the user as links.

Over the past few years, a new medium, called a blog, has appeared on the web. Blogs (short for web logs) are publications of personal thoughts that are typically updated frequently with new journal entries, called posts. The content and quality of blogs and their posts can vary greatly depending on the purpose of the authors of the blogs. As blogging becomes more popular, the ability to provide quality blog search results becomes more important. [emphasis added]
 

SUMMARY OF THE INVENTION

In accordance with one implementation consistent with the principles of the invention, a method may include receiving a feed; fetching a blog and one or more posts associated with the feed; extracting information from the feed, the blog, and one or more posts; creating a hybrid document based on the extracted information; and using the hybrid document to determine a relevance of the blog or the one or more posts to a search query.

In another implementation consistent with the principles of the invention, a device includes a memory to store instructions and a processor. The processor executes the instructions to receive a search query, determine a relevance of a blog or a blog post to the search query based on information extracted from the blog or blog post and information extracted from at least one other source, and provide information relating to the blog or the blog post when the blog or the blog post is determined to be relevant to the search query.

In yet another implementation consistent with the principles of the invention, a method may include receiving a search query; determining a relevance of a first set of documents to the search query using a second set of documents, where the first set of documents includes blogs and blog posts and the second set of documents includes hybrid documents created from the first set of documents and at least one other source; and providing information regarding documents in the first set of documents determined to be relevant.

In still another implementation consistent with the principles of the invention, a method may include receiving a search query; identifying a first set of documents to provide in response to the search query based on a second set of documents; and providing information relating to the identified first set of documents.

In yet still another implementation consistent with the principles of the invention, a method may include receiving feeds associated with blogs, extracting first information from the feeds, extracting second information from the blogs and associated posts, creating hybrid documents based on the first information and the second information, receiving a search query, determining a relevance of the blogs or posts to the search query based on the hybrid documents, and providing information relating to the blog or posts determined to be relevant.

Click here for details including the specific claims of Google's patent for indexing and retrieving blog posts, which have been included in Google search results for some time now. The patent description concludes with this caveat:

It will be apparent to one of ordinary skill in the art that aspects of the invention, as described above, may be implemented in many different forms of software, firmware, and hardware in the implementations illustrated in the figures. The actual software code or specialized control hardware used to implement aspects consistent with the principles of the invention is not limiting of the invention. Thus, the operation and behavior of the aspects were described without reference to the specific software code—it being understood that one of ordinary skill in the art would be able to design software and control hardware to implement the aspects based on the description herein.

 

Jonathan Schwartz, his new blog -- book?

Jonathan Schwartz, former CEO of Sun Microsystems, has long been an advocate for transparency in the world of business and in the blogosphere, being one of the first Chief Executive Officers of a Fortune 500 company to blog . On February 4, 2010, following the acquisition of Sun Microsystems by Oracle, Schwartz resigned from his post as CEO of Sun. His resignation was a haiku on Twitter that read as follows: "Financial crisis/Stalled too many customers/CEO no more." Now, Jonathan Schwartz has a new blog interestingly titled What I Couldn't Say...

It’s a lot harder writing a blog as an individual than as a Chief Executive.

As a CEO, you have an obvious and explicit agenda in nearly all communications – drive awareness for the company, its products and ideas. The same applies across internal and external audiences (the notion you can separate audiences is comically antiquated – ubiquitous social media renders listeners just as powerful as speakers).

As an individual, my agenda isn’t nearly so clear (at least one reason it’s taken me so long to post a first entry). On the one hand, I’d like to put context around some of the decisions I faced at Sun. There was almost always more going on behind the scenes than was obvious to the outside world, and some of that backdrop might be interesting.

Could probably fill a book.

Cartoon by Hugh MacLeod @gapingvoid on Twitter and gapingvoid.com

Tweet of the Week @CiscoSystems

John Earnhardt noted on The Platform, the blog of Opinions and Insights from Cisco, that this is the 5th anniversary of Cisco's first blog post. This auspicious occasion was marked, today, with this tweet.

Click on the image above to read the entire post, but here's an excerpt.

This first blog entry started something of a trend at Cisco. We now have internal blogs “out the wazoo” (I believe is the technical term). And, we now have 16 “corporate blogs” that you can access on Blogs.Cisco.com. Topics range from High Tech Policy (our first blog), to corporate stuff (this one), to mobility, to collaboration, to Service Provider, to Security, etc.

Over the past five years, we have grown our social media presence, we have added a Cisco YouTube channel for the all-important medium of video, we have added a corporate @CiscoSystems Twitter account (actually MANY Cisco Twitter accounts), we have an active Cisco Facebook account and we are experimenting with other social media like Flickr, UStream and more. The net is that interacting with customers, investors, employees, partners and more is important to us. We want to be accessible. We want to talk….and, more importantly, we want to listen.

Congratulations, Cisco. We're following you.

ABA Journal Blawg 100 Law Blogs

Congratulations to Dennis Crouch at Patently-O and Gene Quinn at IP Watchdog who were recognized in the third annual ABA Journal Blawg 100, the best legal blogs as selected by the Journal’s editors with a little help from their readers.

You're invited to go here and vote for your favorites.

In addition, these Intellectual Property law bloggers should garner a good number of "write in" votes:

They're all winners as far as we're concerned.

USPTO Director David Kappos Blogs

According to the Just a Patent Examiner Blog, USPTO Director David Kappos has started a blog. For the time being, it's only hosted on the internal USPTO servers. However, according to the blog, "we plan to make the blog available to the public in the coming weeks."

Once the blog is made public, hopefully, the United States Patent and Trademark Office will reach out to its followers via Twitter, as well, @USPTO thanks to the foresight of @jmattbuchanan at Promote the Progress, who continues to hold that username for the USPTO to utilize as soon as the leadership is ready to take full advantage of this new communication channel as has the President and so many other government departments and agencies.

 

Guest Blogging on IP.com's Blog

Guest blogger R. David Donoghue, author of the Chicago IP Litigation Blog and a partner of Holland & Knight, is featured on IP.com's corporate weblog, Securing Innovation, where we've published his August Carnival of Trust, a traveling review of the last month's best posts related to various aspects of trust in the business world.

Dave Donoghue is one of a cadre of leading intellectual property law bloggers that have signed on recently to guest blog on the top patent blog, Patently-O. Also guest blogging this month, on Dennis Crouch's Patently-O, are many of the other top patent bloggers: Kevin Noonan of Patent Docs, Brett Trout of BlawgIT, Stephen Albainy-Jenei of Patent Baristas, and Joff Wild, Editor of IAM Magazine and the IAM Blog. Dennis has even got intellectual property blogger emeritus Jeremy Phillips grinning like a Cheshire cat at the prospect of writing an article for the top patent blog.

It's great to see this worldwide collaboration among the top patent blogs; bloggers who have learned the art of guest blogging and are showing others how it's done.

Here on Securing Innovation, IP.com is committed to a collaborative IP blog that is an integral part of the community of intellectual property professionals, attorneys, patent agents, examiners, strategists, and advisors. It's with a spirit of collaboration and mutual support that we've designed our corporate blog to feature a Guest Blogger with a prominent placement, rather than following the standard practice of including the guest blogger among the regular posts. This design gives the Guest Blogger a higher profile for a longer period than is usually the case on other blogs that publish guest authors. Contributed articles on Securing Innovation are also included in the permanent Guest Blogger Archives with a distinctive RSS feed for these featured articles. Looking at the lineup of featured guest bloggers, it makes a lot of sense to subscribe to this Guest Blog and add it to your favorite feed reader.

So far, we've featured the following Guest Blogger articles on Securing Innovation:

Remember Mr. Murphy When it Comes to Protecting Intellectual Property by Jason Shinn

Saving Your Intellectual Property by John Avellanet

Are Patents A Driver of Innovation or Just a Tax? by Stephen Albainy-Jenei

Should Management Be Involved In Patenting Decisions? by Clifford D. Hyra

Vote for the Top Patent Blogs by Gene Quinn

Technology Transfer Office Uses Workflow Software to Boost IP Management, Marketing by the Editor of Technology Transfer Tactics

August Carnival of Trust by R. David Donoghue

We're open to suggestions from bloggers who would like to be featured as a Guest Blogger here on Securing Innovation, the corporate blog of IP.com While we're not Patently-O and can't boast the traffic of, say, Patent Baristas, we are always keen to collaborate with IP Bloggers who have something interesting they'd like to share with readers of Securing Innovation and clients of IP.com who follow our corporate blog. We expect that several of the executives of our clients and customers, some of whom don't even have blogs, will want to publish an article in this forum of intellectual property professionals.

Got an idea for a post that you'd like us to publish or cross-post on Securing Innovation? Send an email to blog@ip.com and we'll be happy to discuss it with you. Nice people, we love cats and dogs.

Anatomy of a Company Blog

Clients of IP.com and readers of our company blog, Securing Innovation, are tech-savvy. So, it's not surprising that most of this blog's followers read the posts via the RSS feeds (really simple syndication) by which we deliver content automatically to subscribers.

Sometimes, though, we wonder if those who subscribe to one or more of these RSS feeds aren't missing the big picture.

And, by that, we don't mean just the professional design of our company blog, created for us by the talented team of legal and corporate communications specialists at LexBlog. We're talking about the total picture, the way the company blog organizes content and presents an organic arrangement of information, each segment of the blog interacting with and complementing the other.

In additon to the main posts of this company blog, where we write about managing intellectual property, patents, trademarks, and trade secrets, we've added a sidebar column of "Quick Links" that is sort of a "clipping service" where we link to blog posts and news articles that are of interest to us at IP.com and, perhaps, to our readers. Some subscribers get our Quick Links in a separate RSS feed from this blog. Others might not have noticed it's a separate feed.

Also in the sidebars, we've collected links to several IP Resources as well as to many of the leading intellectual property blogs, tech blogs, and business blogs. This is the community of intellectual property professionals with which we at IP.com are engaged, not only online but in real-world interactions, email, telephone discussions, and face-to-face meetings. Among the many intellectual property resources in the sidebars of this blog, you'll find links to the top patent blogs, and more.

Notably, we've added a "Guest Blogger" column, prominently featured in the top left of the front page, where the contributions of other specialists and leading practitioners of intellectual property law and policy present viewpoints beyond our own perspectives.

What Is IP.com, and why read our blog?

When this blog began, Tom Colson, CEO of IP.com, introduced the company to the blogosphere and discussed why we believe in business blogs.

In our little corner of the online world, there's a lot happening with patents, trademarks, and trade secrets, and a lot of relevant stuff is being said on interesting blogs by people who really know what they're talking about. So we're joining the conversation, and blogging about how innovation is managed by corporations with a vested interest in their Intellectual Property.

Find the conversation. Join it. Contribute to it. "Conversing is how we learn. It's how we network. It's how we grow as professionals," says Kevin O'Keefe, CEO of LexBlog, whose team of experts guided us in the development of our corporate blog, "Blogging is a conversation. Not only do you learn and grow your reputation by joining in, you will not be conspicuous by your absence."

Since adding social networking tools to our corporate communications, we've worked continuously to improve the layout and design of the company blog to better engage in these conversations, adding new features as we grow.

Convergent Media

Where once the tools of corporate communications were press releases and newsletters, corporations today have continuous conversations with their clients, customers, and stakeholders. A press release or, better yet, an article based on the news in the press release, is posted on the corporation's own media channel, the company blog. Corporate executives, clients and customers read about it on the blog, or in their feed readers, and tweet about the news on Twitter, visiting the company blog to comment on the article.

 Click on the graphic link above to see how the CEO of Zappos recently announced the sale of the company to Amazon, in a letter to the employees posted on the company blog and to the followers of @zappos on Twitter, all 1,030,194 of them. Now, we're not a startup shoe seller, but there's lots we can learn from them. In the Twitter stream in the sidebar of our company blog and on Twitter @ipdotcom is where we put what we've learned about convergent media into practice.

If you've got a favorite RSS feed reader, be sure to add all our blog's feeds to those you're following. But don't then forget to stop by the blog, as well, to see what else is going on here. It's a work in progress, and we welcome your comments to make it more interesting and informative.

Please let us know in the comments below how we can make this company blog a helpful place on the internet for you to get the latest information about intellectual property. Did I mention that we're linked to IP Newflash from the company blog? Oh, you saw that. Wonderful.

IBM Bid for Sun: Jonathan Schwartz's Blog

IBM is reportedly in talks to buy Sun Microsystems for $6.5 billion or more. We'll leave it to others to comment whether any proposed deal makes sense or makes no sense.

In any event, it's interesting to observe how masterfully Sun CEO Jonathan Schwartz used his corporate blog in the weeks ahead of this breaking news, in his words, "to set out a clear direction of where Sun's headed."

In a series of posts that coincide with news of a possible acquisition by IBM, Sun's CEO Jonathan Schwartz combined the use of personal video embedded into his blog with transcripts of his detailed description of Sun's strategic imperatives:

1. Technology Adoption
2. Commercial Innovation
3. Efficiently Connecting 1. and 2.

Jonathan Schwartz sets up the series of three substantive posts with these words:

As you know, simplicity takes a lot of engineering, so it's easy to say "just three things," but I'm not in any way suggesting these tasks are easily accomplished. But our intent is to create, promote, and commercialize the highest quality network innovations. Innovations that captivate developers, and deployers.

To understand Sun, you have to understand both, you have to see what drives our financial performance, as well as read our financial statements. Absent both perspectives, you'll miss the bigger picture, the bigger threat, or the bigger opportunity.

Simply brilliant corporate communications.

Might the real "plum" in this acquisition be Jonathan's Blog?

Check this out:

Speculation is rife that IBM is closing in on a $6.5bn deal to acquire Jonathan Schwartz’s blog. If it happens it will become the largest acquisition in the blogosphere to date, and lend weight to the argument that Sun Microsystems’ CEO was right to spend so much of his time blogging...

What are the top patent blogs?

US Patent Attorney Eugene R. Quinn, Jr., president and founder of IPWatchdog, Inc., has put together a list of the top patent blogs as ranked by Technorati, one of the leading blog search engines and a recognized source of information about popular blogs and those who link to them.

Patently-O

Patent Baristas

IPWatchdog 

Against Monopoly 

Patently Silly 

Chicago IP Litigation Blog 

PHOSITA 

Spicy IP 

PLI Patent Practice Center 

Duncan Bucknell Company’s IP Think Tank 

Patent Prospector 

Securing Innovation 

Peter Zura’s 271 Patent Blog 

The Invent Blog

Promote the Progress 

I/P Updates

IP NewsFlash 

Orange Book Blog 

The IP Factor 

Philip Brooks' Patent Infringement Updates

Patent Docs 

Anticipate This! 

Patent Fools (now operated by IPWatchdog.com) 

Patentably Defined 

Steve van Dulken’s Patent Blog 

IP Spotlight

To discover how Gene Quinn determined these top blogs from Technorati's rankings of the top million blogs, read the IP Watchdog post here. And for more IP Blogs, including patent blogs too new to have risen among the ranks of these more established blogs, please check out the many others linked in the sidebar in the left column of our blog, Securing Innovation.

Obviously, we're delighted to find our intellectual property blog in such fine company. Thanks to everyone who has kindly linked our blog and introduced us to their readership. We appreciate that our blog has risen rather quickly in its first year in the blogosphere, and to unexpected heights, on the shoulders of giants. Thank you.

Turning One and Blogrolling Right Along

It's been about a year now since we started our blog, Securing Innovation, and were given this warm welcome by Victoria Pynchon at The IP ADR Blog. Since then, our readership has grown remarkably, and many have subscribed to this blog's feeds.

Securing Innovation also got a lot of link love when we hosted Blawg Review #179, which highlighted the best of the intellectual property blogosphere while commemorating the invention of the ballpoint pen. We're especially grateful for the generous comments from Victoria Pynchon regarding our relatively new blog.

If we have achieved such heights, it is by standing on the shoulders of giants, and we'd like to acknowledge the support of those bloggers in the intellectual property community who have kindly added a link to Securing Innovation in the blogrolls of their well-regarded blogs.

Without them and many others who have, from time to time, shared our posts with their readers, we could not have had such a successful first year of blogging. It's really been more than we expected.

We've learned a lot and we're continuing to pick things up as we go. And we're adding new features to make this blog even more useful, to us and our readers. More about that in the weeks ahead. In the meantime, we hope you enjoy as much as we do the many useful IP Resources, Tech Blogs, Business Blogs, and IP Blogs that are already linked in the sidebar blogrolls here on Securing Innovation. If you think of others we should link, please let us know in the comments below. Thanks for all your help.