Skip to main content

Perl's text handling to the rescue....

There was a very interesting proble set forward at the Ingres forum, quoting :

"I have to use an Ingres Database to store my data which is in several languages (like french, german, and so on). In my web site, the user can use the function "search".
The problem are the special characters like éàèâ in french, or öäü in german. The user doesn't enter these characters, but it should be found anyway.

For example:
 
In the database is the word "château". The user types "chateau" (whithout â)
The program should find the "château", even if "chateau" was typed.
So all accents in the database should be replaced by something more
useful (like "_") 
Has someone an idea how to do that?
Ingres database version : 9.2.1 
 
Thanks a lot, Kakmael"

 and this is my attempt to tackle it,using Perl,of course


Ok you so you get the latin1 encoded string "Chateu" from your web form and you pass it to a CGI Perl script which does the following :

use charnames ':full';
use strict;
my $input_string="Chateau";
my @results;
my %mappings=( "\N{LATIN SMALL LETTER A}" => ["\N{LATIN SMALL LETTER A WITH GRAVE}","\N{LATIN SMALL LETTER A WITH DIAERESIS}"]);

@results=("'".$input_string."'");

foreach my $hash_key (keys %mappings) {
    foreach my $array_key ( @{$mappings{$hash_key}} ) {
        my $temp;
        ($temp=$input_string)=~ s/$hash_key/$array_key/;
        push @results, "'".$temp."'";
    }
}

my $search_string;
 {
local $"=",";
$search_string= 'SELECT * FROM test WHERE col2 in ' . '(' . "@results" . ')' ;
}

print $search_string;


basically you have a hash that maps the to be replaced characters to their counterparts by storing them into an anonymous array reference :

my %mappings=( "\N{LATIN SMALL LETTER A}" => ["\N{LATIN SMALL LETTER A WITH GRAVE}","\N{LATIN SMALL LETTER A WITH DIAERESIS}"]);

then you iterate through the nested data structure and you substitute the sought after character with its counterpart (the two foreach loops) and then you build the final string with a neat trick to get the right amount of commas correct
So the final $search_string will contain ('Chateu','Chàteu','Chäteu')

Of course this does not cover all possible cases (for example do you want all the 'a' replaced or just the first one??) since after all I don't know what the exact requirements are, and will need some tweaking, but you get the drift

Comments

Popular posts from this blog

Insider's Guide To Udacity Android Developer Nanodegree Part 3 - Making the Baking App

Continuing to chart my experience of Udacity's Android Developer Nanodegree we step up in level, embarking on the advanced part of the super-course.
Completing project "Popular Movies" (see Part 2 of this series) signaled the end of "Android Developer". Now we are ready to tackle the second element of the program "Advanced Android Developer", a new class with a new syllabus and project. Continuing to chart my experience of Udacity's Android Developer Nanodegree we step up in level, embarking on the advanced part of the super-course.

Completing project "Popular Movies" (see Part 2 of this series) signaled the end of "Android Developer". Now we are ready to tackle the second element of the program "Advanced Android Developer", a new class with a new syllabus and project.

"Advanced Android Developer" is a mixed bag of self contained material and of coding seven different sample apps to learn about the…

Export your Wunderlist tasks with XPath

As brought up in this ProductHunt thread, the news is that Wunderlist is going to be deprecated in favor of the new Microsoft To-Do note taking platform.

This is what Wunderlist support had to say in response to my inquiry on Wunderlist's future:

"Now that the next evolution of Wunderlist is here, in the form of Microsoft To-Do Preview (https://www.wunderlist.com/blog/...), Wunderlist will no longer receive any updates or bug fixes and will eventually be retired. It won’t happen in the next few months and we’ll be sure to give our users plenty of notice beforehand. In the meantime, you can continue to use Wunderlist normally. Of course, we’d also love for you to try To-Do and let us know how you like it – and how we can improve it. While Wunderlist will continue to exist alongside To-Do for the time being, support for Wunderlist will eventually be removed. Not to worry, though! We will inform all Wunderlist users prior to shutting down service. You'll have ample opport…

AWS and Ionic Team Up In Starter Project

Amazon is quick in recognizing that just offering support for a number of popular programing languages is not enough to lure hoards of developers to the platform. That's why we are seeing a move towards wrapping its AWS services with greater user-friendliness.

The start was made with the introduction of CodeStar, which aimed to simplify the setting up of a project's AWS infrastructure, especially  with regard to policy and authorization, as we examined in CodeStar to Simplify Development On AWS. 

It continues this trend with the release of the open source Ionic AWS starter project Mobile Web and Hybrid Application which aims to act as a skeleton, or boilerplate, Ionic application tweaked in such a way to give developers a headstart in configuring their mobile Ionic front-end applications in relation to an AWS backend.

full article on i-programmer