Skip to content

Search Engine Feature Request: Partial Pages

I wish there was a way to tell the search engines to only index the “content” area of a page. For example, on a blog, only the text from the title through the last comment should be indexed. Everything else, site navigation, ten most recent post titles, ten most popular text titles, footer text, should not be indexed. On my theater site, the right sidebar has a list of the shows that are coming up the next week. But when I search the site with Google, and happen to search for a show that is currently playing, every page on the site comes up. This isn’t useful.

One of the web sites I used to manage used a Perl-based search engine, Fluid Dynamics Search Engine. You could put comments in your site templates so that it wouldn’t index sections of a page. This worked very well. Their FAQ has an entry on this: How to prevent sections of your pages from being indexed. I want this functionality in the big search engines as well.

Upate: 2019-05-21: Links removed, as the site that offered FDSE appears to be dead.

Incorporated Twitter Into One of My Blogs

One of my other blogs has several sections in it that are not powered by WordPress, but my wife and I do a lot of work in those sections. But that work is behind the scenes, which means it isn’t immediately obvious that we’ve done anything to the site. So I created a twitter account just for that site, and installed their widget into the sidebar. We’ll see how well that works, but I think I am already pretty happy with the results. The site is ShowBizRadio.net, the audition and schedule listings are the largest sections that aren’t run by WP.

By the way, my twitter account is planetmike

The New Bitacle: EnjoyCHN.com

I accidentally have discovered a company doing exactly what Bitacle did in mid-to-late 2006. It is scraping people’s RSS feeds from their blogs and mirroring the content in its entirety. Then they wrap lots of Google Adsense ads around it. So they are making money off of my work, and the work of others. The site is web.EnjoyChn.com. No link, they don’t deserve one.

From browsing their site, it looks like there are several hundred sites being pillaged this way. If I have time later this week, I may try to contact some of those people.

After the Bitacle debacle last year, I added the “Angsuman’s Feed Copyrighter with Bitacle blocker” WordPress plugin to my site. It adds a copyright message to the footer of each post, which is dutifully reproduced on the violating web site. I tweaked the script to also include the IP address and the date/time stamp when my feed was retrieved. That information is also reproduced faithfully on the thief’s web site.

I went through all 41 of my pages, and found that until September 11, 2007 at 16:12:12, the thief was using IP address 58.31.119.82, which is assigned to a Chinese ISP. Since that date, all the requests have come from 69.93.255.82, which resolves to HostGator.com. The infringing web site resolves to 69.93.255.83, also at HostGator.

I have filed a complaint with (1) Google, to get the offender out of the Google index, (2) Google Adsense, to hopefully get them out of the Adsense program; and (3) HostGator, the web host involved in this. We’ll see what happens next.

Useful Resources:

Advertising Removed From My Site

After last week’s PageRank update, plus because the ads were making the site look very cluttered and busy, I have decided to remove all advertising from PlanetMike.com.

I’ve been experimenting with advertising on the site since leaving my full-time webmaster gig in February, 10 months ago. I have used Google Adsense, Adbrite, Project Wonderful, and Text-Link-Ads. I did receive some money for these ads, enough that it was a bit more than chump change, but not enough that I could retire completely. Each ad system had their advantages and disadvantages:

  • Project Wonderful: Easy to implement; very low payout unless you have a very popular site. (See Thoughts on the Project Wonderful ad system for more info.)
  • Google Adsense: Easy to implement; low payout.
  • Adbrite: Easy to implement; low payout; R and near-X rated ads, one ad was trying to infect Windows PC’s, which was when I dropped the program. Adbrite won’t give me my $4.94 that has accumulated in my account .:(
  • Text-Link-Ads: can be tricky to implement, requires you to give special file permissions to a settings file on the web site; ads will affect your rankings in search engines; some ads are links to bad places; very high payout.

So how am I going to make money from PlanetMike since I will no longer carry advertising? For the short term, I’ll beg for donations. Longer term, I’ll ramp up my programming and consulting efforts, which will probably include reviewing sites. So, if you need some help on a web issue, feel free to contact me.

I also will be looking into what search engine I should use so that the site can be searched. Right now I use Google’s, so I do make a few cents from those searches. But those few pennies aren’t enough to justify not looking around to see what the options are. I’m open to suggestions, just leave a comment here.

(Updated on 2018-04-27: Removed two dead links, and fixed two internal links that were pointing to the wrong pages on my site.)

Contextual Rollover Ads Suck

I’m sure you’ve seen them, those double underlined links from Kontera ContentLink, Vibrant Media, Snap. You rollover the link with your mouse, even accidentally, and you get small pop-up window showing you some advertising. I just found a site who has changed the link to a more regular looking single underline. And the word that was chosen as a link was the perfect word to be a helpful link to information at Apple.com. So I put my mouse on the link to click it, and a pop-up ad appeared helpfully suggesting I go to Apple.com. It was a Vibrant Media ad. I looked and there is no way to opt-out of seeing those ads. At least with Kontera ContentLink, I can opt-out, on a site by site basis.

You don’t think that the publisher doesn’t know those ads are annoying? Check out LinuxQuestions.org. They added ContentLink to their site, and are using it as a reason that you should register with their site. Quote: “Note that registered members see fewer ads, and ContentLink is completely disabled for all logged in members.”

Please reconsider using this type of advertising. It’s intrusive, distracting, and frustrating when things happen on-screen when you aren’t expecting it. And if you do that type of thing, you should never change the default formatting. We web users have had almost 15 years to learn that an underlined text is a link. Don’t change that expectation, it will hurt everyone in the long run.

Blogosphere News is in Beta

Yet another Digg-style, social networking site has been released, Blogosphere News as a beta. It’s under the management of Splashpress Media. It’s a good thing it’s in beta, it is still very unpolished. Dead links to the advertise page. And Mermberships (sic) are free. No contact form to reach a webmaster or designer or intern to let them know of problems. And no privacy policy. But it will be nice to have one site to get a lot of info about blogging.

Thoughts on the Project Wonderful ad system

I first stumbled across Project Wonderful in December 2006. Project Wonderful is a system that allows ads to be bought and sold as an auction, highest bidder gets the ad, and it changes in real time, so you can have an ad on a site in minutes. It started out as a way for publishers of web comics to be able to easily add advertising to their web sites, but any site can take advantage of the system.

I’ve been running some advertising on a couple of my sites (an “adbox”) the past few months, and have also run some advertising (“bids” or “campaigns”) on other sites. I allowed ads to run both here on PlanetMike, and on my Christmas site. Here are my thoughts about the Project Wonderful (PW) ad system.

For all users:

  • I wish I could stay logged into the system. The current logout time is something like 30 minutes, which is annoying. Yes, it can be dangerous, but if you aren’t on a trusted computer, you shouldn’t check a preference box to stay logged in. If someone steals my home computer, I have other problems anyways.
  • The Project Wonderful web site itself is sluggish at times. Logging in takes 15 to 30 seconds occasionally.

For publishers (sites letting others advertise on them):

  • Bids shouldn’t be able to be removed or cancelled by the bidder quickly. What happens is they bid some amount and then figure out that someone else has a much higher limit. So the bid rate goes up to $.40 but two hours later drops back to 2 cents because the incentive for the higher bid has gone away. A bid should have to remain in place for at least 24 hours.
  • The web publisher may set a minimum bid. But the ad says “your ad here,” which is a waste. Let the publisher put their own ad there, linking to wherever, at no cost. But still let someone bid more than my minimum to get the ad themselves. Right now I feel that space is wasted while I’m waiting for minimum bids. The example is the banner ad at my Christmas music site. Update (2007-10-31 11am): Ryan from ProjectWonderful pointed out to me that this capability is already in the system. I was simply misreading a certain set of options in the control panel area of the adbox definitions.
  • Pages with PW ads load slowly. On my table-based site (ugh, I know, I need to redesign) there is usually a visible delay when the page gets to the PW cell.

For advertisers (putting your ad on other sites):

  • The campaign system is wonderful. I ran three campaigns in early October to advertise ChristmasMusic247.com. The first was for only ads of any size that could be bought for a price of zero to two cents. After a week, I changed to two campaigns, one only for button ads (117×30 pixel images), the other for large ads (468×60 or 728×90 images) only. The results were amazing.
    Campaign Bid limit Total Page Views Total Click Total Expense Cost per 1000
    Page Views (CPM)
    Cost per Click (CPC)
    2 Cent Campaign
    October 1 to October 4
    $0.02 97,091
    (54,703 unique)
    353
    (343 unique)
    $4.05 $0.04
    ($0.07 unique)
    $0.01
    ($0.01 unique)
    Button Campaign
    October 10 to October 19
    $0.01 1,173,830
    (586,150 unique)
    2,434
    (2,167 unique)
    $36.18 $0.03
    ($0.06 unique)
    $0.01
    ($0.02 unique)
    Large Ad Campaign
    October 10 to October 19
    $0.06 570,720
    (254,377 unique)
    1,698
    (1,515 unique)
    $52.45 $0.09
    ($0.21 unique)
    $0.03
    ($0.03 unique)

    In those three weeks, I received nearly 4,500 clicks, for a cost of just over $100. If I had tried to run a Google Adword campaign, the cost would have been much higher for the keywords I would have had to choose in order to get high placement on pages. I just created a campaign using Adwords, and the Traffic Estimator said my keywords would need to be 20 cents (Christmas music) to 50 cents (Christmas carols), so an equivalent number of clicks would have cost me $900 to $2,250!

  • Keep in mind I was competing with other advertizers, so the sites that my ad was on changed daily (hourly!). But if I felt it was very important to be on a specifc site, I could set up a separate bid for that site, with a higher bid limit.
  • So as PW gets more popular, and daily bid rates increase, the CPM and CPC will increase. But right now, PW is an incredible bargain.

Overall, the Project Wonderful system works very well. I encourage you to check it out for your next web project. PW is very economical, easy to use, and supports small web-based businesses, but could easily be used on larger sites as well.

Should I Move All of My Site’s Content into WordPress?

Right now PlanetMike.com is made up of several different technologies: WordPress, a lot of shtml pages (server side includes), a few php pages, and lots of photo galleries created with iPhoto and Gallerie. After the recent Google PageRank update, I’m thinking about basically starting over with the entire site. I think I want to move the blog from /journal/ up to the top directory level of the site. The largest section of the site would be my jokes archive. Most (all?) of the jokes have dates on them, so I could move them into my WordPress database. That would give me a ton of blog posts, but overall the site would be much easier to manage.

I’d love to hear your thoughts on this. I won’t have time to start doing anything until December, so I’m not in a rush.

Google Pagerank 6 to 4! and 0 to 3 or 4!

Wow, my Google PageRank for the PlanetMike.com home page has dropped to 4 from 6, down from 7 a year ago. Speculation elsewhere on the web says that Google is penalizing people for using Text-Link-Ads. Well, if that’s the cause, hmmm, fine I guess. Up until this month, I made much more money from Text-Link-Ads than I ever did from Google Adsense. So could this be case of Google telling people to stay with us, don’t use any other advertising system?

Something interesting I just noticed, it seems that more of my inside pages have increased their page rank. So maybe Google is simply distributing the PageRank from the home page among all of the site’s internal pages?

Other sites discussing this:

Two of my other sites haven’t changed their PageRank, both are still at 5.

Shelfari.com Spam

The latest attempt to get into people’s webmail accounts is from a group Shelfari. Apparently set up as a service to help your friends learn about new books to read. They do this by getting you to give them your webmail account’s password so they can look to see if any of your friends are also members of Shelfari. In reality, Shelfari.com sends spam masquerading as you to everyone in your webmail’s Address Book.

See other reports of this behavior at:

I have blacklisted email from shelfari.com to my servers, as a complaint I filed there a few days was blatantly ignored and they did it again, emailed one of my addresses another “invite” to join.

Reminder: Never ever ever give your webmail account information to a third party.