Difference between revisions of "UTF8 Tools"

From Free Pascal wiki
Jump to navigationJump to search
Line 16: Line 16:
 
      
 
      
 
''Simple demo:''
 
''Simple demo:''
 +
<syntaxhighlight lang="pascal">
 
  fCES := TCharEncStream.Create;
 
  fCES := TCharEncStream.Create;
 
  fCES.LoadFromFile(OpenDialog1.FileName);
 
  fCES.LoadFromFile(OpenDialog1.FileName);
 
  Memo1.text := fCES.UTF8Text;   
 
  Memo1.text := fCES.UTF8Text;   
 
  fCES.free;
 
  fCES.free;
 
+
</syntaxhighlight>
  
  

Revision as of 22:44, 23 March 2012

About

Sharing some of my code


UTF-8 Tools

Purpose

Some tools for common problems with UTF-8 / Unicode.


  • charencstreams.pas: Load and save data from / to almost any text source like
    • ANSI, UTF8, UTF16, UTF32
    • big or little endian
    • BOM or no BOM

Simple demo:

 fCES := TCharEncStream.Create;
 fCES.LoadFromFile(OpenDialog1.FileName);
 Memo1.text := fCES.UTF8Text;  
 fCES.free;


  • character.pas: Get Information about code points using the TCharacter class.

Demo

if TCharacter.IsLetter(s[i]) then s[i] := TCharacter.toLower(s[i]);


  • utf8scanner.pas: Access UTF-8 strings by code index, use case statements on UTF-8 strings and more...

Index demo

s := TUTF8Scanner.Create(Memo1.text);
for i := 1 to s.Length do
if TCharacter.IsLetter(s[i]) then s[i] := TCharacter.toLower(s[i]);
Memo1.Text := s.UTF8String;
s.free;

Case demo

 s := TUTF8Scanner.Create(Memo1.text);
 s.FindChars := 'öäü';
 repeat
   case s.FindIndex(s.Next) of
 {ö} 0: s.Replace('oe');
 {ä} 1: s.Replace('ae');
 {ü} 2: s.Replace('ue');
   end;
 until s.Done;
 Memo1.Text := s.UTF8String;
 s.free;

Download

Download utf8tools.zip