Archive for the ‘Tech’ Category

Google acquires reCAPTCHA for book scans

Monday, December 21st, 2009

Google acquires reCAPTCHA for book scans

Teaching computers to read, Google hopes to bolster Google Books and the Google News Archive

Technology trends and news by Dan Spelzmann
September 16, 2009 | Comments (0)
Short URL: http://vator.tv/n/aa8

16340

Google has acquired reCAPTCHA, the company known by most users as a provider of those (slightly annoying) tests where you have to type out the squiggly, morphed words displayed to sign in to a site. The idea is to prevent bots from buying all the tickets for a show in the first 10 seconds of the sale or signing up for every available email address.

reCAPTCHA

Google says reCAPTCHA currently guards over 100,000 Web sites from such spam attacks.

The service has much broader applications, though.

reCAPTCHA is aiding the massive task of digitizing books, newspapers and old time radio shows. For physical books, it’s a two-step process: scan a page, then transform into text using “Optical Character Recognition” (OCR).

Unfortunately, even the most sophisticated OCR program cannot easily transcribe just any scanned image of a page of text, for example, because in some older books, either time has taken its toll on the paper and ink or the font is just plain weird. But humans can probably figure out what it means.

fail

According to reCAPTCHA’s Web site:

About 200 million CAPTCHAs are solved by humans around the world every day. In each case, roughly ten seconds of human time are being spent. Individually, that's not a lot of time, but in aggregate these little puzzles consume more than 150,000 hours of work each day.

reCAPTCHA gives users two words. The first is a word reCAPTCHA knows. The second is the word from that ancient or damaged text that the computer is trying to transcribe. If a user gets the first word right, then reCAPTCHA assumes it’s dealing with a human, and accepts the user’s input for the second word. After many run-throughs with many different users, reCAPTCHA pools all the inputs for the second word and assumes the majority answer is probably what the word actually is.

In this way, reCAPTCHA can continually utilize the crowd to correct and improve its OCR.

Google’s acquisition of the company makes a lot of sense, considering that they are currently invested in two large-scale digitization projects: Google Books and the Google News Archive.

Bing plays catch up

Tuesday, November 24th, 2009

Google Loses Ground to Bing with Small Biz Search Advertisers

New data from WebVisible indicate that small businesses are diversifying their paid search campaigns to include sites not named Google. The local interactive advertising firm's report found that 60.4 percent of search spending went to Google last quarter, while Yahoo reaped 26.2 percent, Bing garnered 10.5 percent, and Ask.com pulled in 2.4 percent.

Google lost 5 percent share compared to Q3 last year, according to the study that surveyed around 25,000 companies with less than 200 employees during the last four fiscal quarters. Kevin Ryan, CMO for the Irvine, CA-based WebVisible, said the most-interesting finding was how Microsoft's aggressive promotions for its new kid on the block, Bing, have paid off, gaining market share from Google.

“No one expected this to happen when they came into the game just six months ago,” he explained. “I'm sure people are wondering if Bing's success will last beyond its huge media buy. Once Bing stops spending that kind of money to get in front of local advertisers, [small businesses] may forget about [the search site]. That is, unless it is as effective as it claims to be.”

While click-through rates were up year-over-year for all the engines, Yahoo charted the biggest improvement with a 123 percent CTR increase. Bing's CTRs lifted by 76 percent, while Google experienced a 52 percent hike. Google's cost-per-click average was 14 percent higher compared to Q3 last year and was 30 percent more expensive than Yahoo and Bing for the most-recent quarter.

Small Businesses Increase Spend by 91 Percent

Generally, small businesses are buying more keywords and dramatically increasing their paid search budgets when compared to last year, according to WebVisible. First off, the average small business purchased 55 keyword phrases in Q3, which is up 30 percent from Q3 2008's median number of 43. That statistic represents the high-water mark for the four quarters that WebVisible has been tracking small businesses keyword buys.

Meanwhile, businesses dedicated an average of $1,658 to search ads, 91 percent more than Q3 2008. And business-to-consumer professional services appear to be the busiest in terms of collecting local sales leads via SEM. Attorneys and dentists made up the top two advertiser categories, with 7.7 percent and 5 percent of total small advertisers, respectively. Each of the two categories invested far more than average, spending $2,560 and $2,005 respectively in Q3.

Air conditioning services and physicians/surgeons were the only other categories that accounted for more than 2 percent of search advertisers. Overall, the research suggests that the small business search advertisers are a varied bunch. The top 20 categories accounted for only 36 percent of total dollars spent.

Thirty-two percent of search clicks resulted in a “lead conversion,” meaning the viewer either clicked through to a landing page on the advertiser's Web site, printed a landing page, watched a video, printed out directions, entered an e-mail address, inquired via e-mail, or completed an online form. Clicks to the Web site were far and away the biggest lead conversion type, coming in almost twice as high as the next three categories: printed landing pages, submitted e-mail inquiries, and printed driving directions.

For small businesses utilizing a call tracking number, 4.5 percent of the clicks resulted in a call, a 3.6 percent lift from 2008. No material differences occurred among advertisers in terms of CTR or proportion of lead conversions. However, WebVisible said that cost-per-clicks and keyword counts tended to increase with rising spend levels.

Email archive setups

Wednesday, October 28th, 2009

Suspect in Danish terror plot staying in jail for now – Chicago <b>…</b>

Chicago Breaking <b>News</b>. Powered By. in Chicago … Share this story: Twitter; Facebook; Email. Breaking and recently updated stories from the Breaking <b>News</b> Center: … MUGS IN THE <b>NEWS</b>. Death by stabbing · 334 marijuana plants …

A <b>New</b> Financial Model For <b>News</b>, Straight From The Cable Industry <b>…</b>

The challenge facing the <b>news</b> business today is now so worked-over that any edges have become smooth. The well-worn dilemma: more people than ever are consuming <b>news</b> and information, but the free internet distribution driving that …

<b>News</b> Ticker: The Killers, Morrissey, Lilith Fair, Jay-Z : Rolling <b>…</b>

Up to the minute breaking <b>news</b> from the world of music, from the editors of Rolling Stone.

Email Archive

museum_email_archive-53043-1

I find I would rather have my own email archive. I do it already and it makes me more comfortable to be in charge of my emails. If I am having another take charge of my email archiving, how am I to know that they will not use what I have said or others have said to me against me? I just think in terms of dealing with the archive, I would prefer to have it accessible to me as I see fit. It is my private property; it is my thoughts and responses to my thoughts. It is mine.

Some might argue that creating an email archive on my own is tantamount to setting me up for a loss. Hard drives do fail. Programs change and connections can be lost or degraded. I am willing to take this kind of risk seeing that ultimately these are my own risks. I feel that relying on another to be able access, protect or even find something I need puts me at a disadvantage.

Still, many would prefer not to have to deal with the whole idea of keeping any kind of an archive let alone an entire mailing archive. They find the programs confusing or darn right too hard to deal with. They are afraid of doing the wrong thing, pushing the wrong button or losing where the archive is being stored. Some times they lose the program that they use to create the email hosting setup and all they have is something that is not able to be opened. That presents a quandary that is just a frustration. Most people are frustrated enough without having another program create a frustration for them.

Even with these negatives, taking the time to learn to do my own technical archiving makes me in control of my correspondence. I prefer to have the control that doing my own archive presents to me. I do not have to worry about someone else looking through my old emails and reading my private correspondence. I can take pride in learning a new skill. I have the satisfaction of finding what I want when I want it. I do not have to be at the mercy of some outside company and that company’s handlers as to when I have access to my own private correspondence. I definitely do not find it a negative to be in control of my own emails.

My significant reason to control and create my own email setup is security. I am not egotistical in the sense that I have great or damning information in my emails that I need to be in total control of my business. Nor do I feel that I have such great prose in my emails that someone would be looking to plagiarize what I or my correspondences have written to one another. Nor do I feel that I have any particular hidden jewels or treasures in my email archive. What I do feel is since all of this was sent to me or me to others, it is mine and I have the freedom to keep what is mine unless there is any compelling reason to let another or anyone have access, even the original authors of the original emails (not that I would stop them from seeing some writing of theirs I have in my possession).

Yahoo Mail Suffers Outage

Monday, October 26th, 2009
Yahoo Mail Suffers Outage

It appears that Yahoo Mail is suffering from an outage. Complaints from Yahoo Mail users are all over Twitter. It’s unclear how extensive the outage is and what is causing the issue but we’ll report back when we find out.

Yahoo Mail is currently the No. 1 Web mail service with 300 million people using it worldwide, so even a smaller outage could result in a large amount of people not having access to their email accounts. Competitor Gmail also succumbed to a serious outage recently, leaving users with nearly eight hours of downtime.

Update: It appears that the outage isn’t system-wide but users are still complaining of the service being down. Coincidentally, Yahoo is shutting down Geocities today so perhaps they pulled the wrong plug?