House Auction

When I returned from work ten days ago I was surprised to see a sticker binding my house gate together. I thought that my landlord didn’t pay the house installments although I never missed to pay the rent.

It was a relief to learn that it was just an advertisement. If I see the person who sticks this on my gate I would have kicked him in the crotch.

OCRopus and Tesseract

A friend pointed me to an open source project called OCRopus because I am currently working on a project related to OCR. Commercial OCR solutions ain’t cheap and you can really dig a hole in your pocket trying to get a good OCR solution. It’s neither the price of the hardware nor the software that is high but the amount of work that needs to be done to make sure a correct output is obtained.

Most OCR solutions need a vast amount of time to train the software to correctly identify characters. Artificial Intelligence can help but not now, not today, not yet.

OCRopus is not the one who recognize the character itself but it relies on Tesseract. OCRopus provides layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities. Sounds really good doesn’t it? Tesseract is the OCR engine that OCRopus uses.

Most of the project is tested and developed on Ubuntu, but if your platform has binutils and build tools you’re good to go. I believe it is also possible to build using Microsoft Visual Studio on Windows and of course MingGW. I went for the easiest option since I only have 2 hours to spare and I already have Cygwin on my system.

I first installed libraries header files (libpng-devel, libtiff-devel, libjpeg-devel) and build tools (gcc, make, g++, autoconf) and then built tesseract with the normal ./configure && make && make install method. To build OCRopus there is a need for Perforce Jam. Jam is actually Just Another Make. I find it a little funny when I have to build Jam using make. Oh well. OCRopus is built with ./configure && jam && jam install and it went pretty well.

To run them don’t forget to download the language files for your target language otherwise it will complain: Unable to load unicharset file /usr/local/share/tessdata/eng.unicharset

I ran my tests with standard LUA scripts that came with OCRopus (located in /usr/local/share/ocropus/scripts/) with the command ocroscript.exe rec-tess input_image > output.html

I created a 10 line Word Document with different fonts and printed it to a PDF. Using Adobe Photoshop I saved it to a JPG image. Then I gradually resized the image to the smallest I can get some output with.

To see the tests and results, click on Continue Reading.

Continue reading OCRopus and Tesseract

MMU For Sale

Yeah, you heard it right. Multimedia University (MMU) is up for sale, and it’s valued between RM800~900 million. The plan is to raise money for the high-speed broadband (HSBB) project which will cost TM RM1~1.5 billion per year.

Here’s an article from Business Times Online.

From the line While management remains optimistic on the disposal of staff loans, the group is struggling to find a buyer for the university,” the duo noted., it’s now confirmed that MMU is for sale!

I was alerted of this news while reading Amanz.

Dirty Promotion Techniques By Mobile Content Providers

More on unwanted text messages via cellphones.

I received a text message from 012 835 3906:

“Jon, video U dgn Jane disebarkan. Cuba htr ON MX ke 36660 skrg. Free reg,tolong jgn pass ke org.Very urgent pls help! Jane call i tadi sampai kelam kabut takut.”

Translation: “Jon, your video with Jane has been leaked. Try send ON MX to 36660 now. Free registration, please do not pass to others. Very urgent! Jane just called me, she is scared and freaked out.”

I remembered hearing something about ON MX in a TV commercial. This is one dirty trick where a pervert or someone looking for “interesting” videos would send in the command and be subscribed to this particular content provider.

I admit the company have a good idea to do this scam, but then again it’s still a dirty trick and I disapprove.

I really hope Maxis or any other telcos will take action to prevent these kind of trick to the consumers. Then again, it’s Malaysia. I tried sending an SMS-to-email via Maxis’ 1503 gateway and have yet to receive anything. 😐

The Most Expensive “Baju Melayu” I Ever Had

Or frankly the title should be “The Most Expensive Baju Melayu I will have” because I have just sent the fabric to the tailor a couple of hours ago at Wisma Yakin near Jalan Tuanku Abdul Rahman. The thing is I am unsure whether I will get a more expensive ones later in life.

For those who are wondering what it is, go here for Wikipedia page on Baju Melayu.

I am not getting this one especially for the coming Eid ul-Fitr, but for my niece’s wedding ceremony in August. And since it is pricey I will also use it for Eid.

The fabric is RM200 and the tailor charge is RM120. It maybe normal for certain people but since all of my previous Baju Melayus were tailored by my mom and using normal fabrics this is the most expensive one for me.

The reason I didn’t go for ready-made ones is that they might not fit well, and the ones I found are either too cheap (equals bad quality) or too expensive. The turquoise Baju Melayu will be ready during the last week of July, just in time for the wedding on August 8th.

MasterCard/Visa Promotion Fraud Attempt #2

Back in January I wrote about an attempt to squeeze my credit card numbers by a caller using a private number.

On July 4th, I received a call from 016 336 8916 but since I have my phone on private mode they were not able to reach me. However a couple of minutes later they sent me a text message: “Hello mr/mrs Ady Romantika. I’m Ros from Visa/master card voucher department. Because of your loyalty to us, you are entitled for complimentary vouchers. Please come to our office with your spouse to collect your vouchers. Unit 515, Level 5, Block E, Phileo Damansara 1, No. 9, Jalan 16/11, Off Jalan Damansara, 46350 Petaling Jaya Selangor. Please come anytime from Monday to Sunday between 3pm to 8pm. Please also allow us at least 45mins of your time. Thank you and see you soon.”

It was a long text message indeed. I sent them a reply that they are scammers, and they were bold enough to reply me. Their message now have been sent to the Royal Malaysian Police and the media via email. I am unsure whether any action will be taken to investigate it but I shall wait and see.

As I mentioned in my previous post, there is no logical chance that Mastercard and Visa are running a promotion together. I am pretty sure if I show up they will ask me for my credit cards and take the chance to copy the numbers, expiries, and CVV/CV2 numbers. Then they are free to use my credit cards online.

For the less cautious this might be a trap they might easily fall into. Beware!

Compressing WordPress Output

While toying around with NextGen code so that I can activate my custom image mirror, I saw the output from Firebug. I noticed that my HTML output is not compressed (by the absence of gzip content-encoding).

Some Apache servers have this module already enabled (previously mod_gzip a 3rd party module in Apache 1, and now built-in in Apache 2 as mod_deflate).

But what if you don’t have access to the Apache configuration, such as in a shared hosting environment?

I have the answer for PHP. I always include this line in the bootstrap code of the applications I build using Zend Framework:


And the output will be gzipped prior to sending it to the browser. The result? Faster transfer to users.

For WordPress you can put the line in index.php:

< ?php ob_start("ob_gzhandler"); /* Short and sweet */ define('WP_USE_THEMES', true); require('./wp-blog-header.php'); ?>

Easy, isn’t it? Here are Firebug screenshots, before and after. Notice that I managed to cut the size of my front page by 1/5?

[As the screenshots are too wide please click on Continue Reading to see them]
Continue reading Compressing WordPress Output

PHP Framework Benchmark

In April I wrote about Eclipse PDT, Zend Framework, PHPUnit.

AVNet Labs have executed a comprehensive benchmark against popular PHP Frameworks.

It looks like they are also using Zend Framework for their development. I’ll stay with Zend as well, because I believe in vendor-product compatibility. I will not ask for support from Adobe if I have a problem with Microsoft Visual Studio, so it’s the same concept here.

Zend is The PHP Company.

Thanks to Rizal for the heads up.

Chitika Oh Chitika

On 10 April 2007 I tried to apply for Chitika eMiniMalls just to try out my luck even though my number of visitors is much lower back then. I received a reply which I totally understand and accepted:

In an effort to bring value to our publishers, we carefully consider each submission. During our review process we have determined that Chitika | eMiniMalls might not be a good match for your website.

On 27 June 2008 I received an email from Chitika:

Hi Ady,

Great news! There have been a lot of changes over at Chitika recently – so although our ads were not a good match for your website in the past, we believe that we may be a much better fit for you now. Why? Because the Chitika network now serves Premium ads for ALL types of site content like: Finance, Health, Travel, Family, & more. (Previously we focused mainly on product-related websites.)

We now offer a LOT more than eMiniMalls too – our new Chitika|Premium ads target your search traffic, and are showing extremely high CTRs and eCPMs for our publishers. So if you have a good amount of US search traffic, Chitika|Premium will be a great fit for your site!

Re-open your Chitika application here, to get started. (You will be able to edit your information such as website, email, and PayPal info before you submit) or head over to the “Chitika | Premium” page for more information.

And I received a reply:


The email address you used doesn’t match the domain that you submitted, and it also does not match the email that was used to register the domain, so we cannot tell if you actually own this website.

If you do own this site, please re-open your application using the link below and supply an email address from the domain, or the email address that was used to register the domain.

If you cannot do this, then please re-open your application and tell us why in the “Comments” field. Thanks. Looking forward to your comments.

This is because I used my GMail address for the registration. If I enter my email address for this blog domain I immediately get “Invalid Email ID!”. I stated in the comment field of the registration form but I guess it was not taken into anyone’s attention. But then again, the reply did mention “If you cannot do this, then please re-open your application and tell us why in the “Comments” field. Thanks. Looking forward to your comments.” WTH?

Not that Chitika is bad or anything but as an Internet user I expect an established site updates the list of TLD frequently. I also had the email not matching problem with Nuffnang but they received my application without any problem.

United Airlines Versus Malaysia Airlines

I am comparing 2 very different companies, I know. Malaysia Airlines or frequently called MAS is the first Malaysian airline company established in 1937 as Malayan Airways Limited. MAS currently have 81 aircraft in its fleet.

United Airlines is one of the US’s major airline, established in 1926 as Boeing Air Transport. United have 456 aircraft in its fleet.

I am not comparing both airlines either in-flight services or in company size. I don’t really want to talk about airlines but in services in general. I have to admit in terms of comfort and in-flight services I like MAS better 😉 That’s different. MAS have high standards for in-flight services.

While I was in the US, I tried to claim miles for MAS’s Enrich and United’s Mileage Plus. Who knows in the near future I might be able to utilize my points.

On May 21st, I tried to submit my claims for both airlines using their online missing miles utilities and both failed because for some reason my first/last name pairs got mixed up during the ticket purchase. Thanks Marco Polo travels, “good” job.

So I sent them an email each. It took 2 days for United to reply my mail and ask me for details. I sent them the details and by May 24th I received my United miles for the Los Angeles to Boston trip.

MAS never replied until June 16th. That’s almost a month. I was disappointed and forgot about it until yesterday.

So I sent them an email yesterday with my details for both trips (to and fro Los Angeles). I’ll wait and see how long it will take. I also sent an email to United to claim for my return trip from Boston to Los Angeles, and this morning at 10:20am I received my points.

So you see, Malaysian companies likes to neglect good customer service. Simply because there is not so many choices. We have 1 electric company, 1 land-line phone company, 1 consumer Internet provider, and few of each other important services. Monopoly is the keyword in Malaysia, minorities don’t even have a chance. The airline story is just a simple example.

While in the States I was quite surprised with the level of attention and service I received. It was really a good experience. I am not exaggerating, seriously.

I think this is simply because the size of the country, and the number of competitions they have over there. Everyone have to strive to be the best, and in the end everyone provides the best they can.

What do you think?