Hi programmers,
I want read line by line a Unicode (UTF-8) text file created by Notepad, i don't want display the Unicode string in the screen, i want just read and compare the strings!.
This code read ANSI file line by line, and compare the strings
What i want
read_ansi_line_ by_line.c
test_ansi.txt
Compiling
Output
Now i need read Unicode (UTF-8) file created by Notepad, after more than 6 months i don't found any good code/library in C can read file coded in UTF-8!, i don't know exactly why but i think the standard C don't support Unicode!
Reading Unicode binary file its OK!, but the probleme is the binary file most be already created in binary mode!, that mean if we want read a Unicode (UTF-8) file created by Notepad we need to translate it from UTF-8 file to BINARY file!
This code write Unicode string to a binary file, NOTE the C file is coded in UTF-8 and compiled by GCC
What i want
create_bin.c
Compiling
Output
Now i want read the binary file line by line and compare!
What i want
read_bin_line_b y_line.c
Compiling
Output
THE PROBLEM
This method is VERY LONG! and NOT POWERFUL (i m beginner in software engineering)
Please any one know how to read Unicode file ? (i know its not easy!) Please any one know how to convert Unicode file to Binary file ? (simple method) Please any one know how to read Unicode file in binary mode ? (i m not sure)
Thank You.
I want read line by line a Unicode (UTF-8) text file created by Notepad, i don't want display the Unicode string in the screen, i want just read and compare the strings!.
This code read ANSI file line by line, and compare the strings
What i want
- Read test_ansi.txt line by line
- if the line = "b" print "YES!"
- else print "NO!"
read_ansi_line_ by_line.c
Code:
#include <stdio.h> int main() { char *inname = "test_ansi.txt"; FILE *infile; char line_buffer[BUFSIZ]; /* BUFSIZ is defined if you include stdio.h */ char line_number; infile = fopen(inname, "r"); if (!infile) { printf("\nfile '%s' not found\n", inname); return 0; } printf("\n%s\n\n", inname); line_number = 0; while (fgets(line_buffer, sizeof(line_buffer), infile)) { ++line_number; /* note that the newline is in the buffer */ if (strcmp("b\n", line_buffer) == 0 ){ printf("%d: YES!\n", line_number); }else{ printf("%d: NO!\n", line_number,line_buffer); } } printf("\n\nTotal: %d\n", line_number); return 0; }
Code:
a b c
Code:
gcc -o read_ansi_line_by_line read_ansi_line_by_line.c
Code:
test_ansi.txt 1: NO! 2: YES! 3: NO! Total: 3
Reading Unicode binary file its OK!, but the probleme is the binary file most be already created in binary mode!, that mean if we want read a Unicode (UTF-8) file created by Notepad we need to translate it from UTF-8 file to BINARY file!
This code write Unicode string to a binary file, NOTE the C file is coded in UTF-8 and compiled by GCC
What i want
- Write the Unicode char "ب" to test_bin.dat
create_bin.c
Code:
#define UNICODE #ifdef UNICODE #define _UNICODE #else #define _MBCS #endif #include <stdio.h> #include <wchar.h> int main() { /*Data to be stored in file*/ wchar_t line_buffer[BUFSIZ]=L"ب"; /*Opening file for writing in binary mode*/ FILE *infile=fopen("test_bin.dat","wb"); /*Writing data to file*/ fwrite(line_buffer, 1, 13, infile); /*Closing File*/ fclose(infile); return 0; }
Code:
gcc -o create_bin create_bin.c
Code:
create test_bin.dat
Now i want read the binary file line by line and compare!
What i want
- Read test_bin.dat line by line
- if the line = "ب" print "YES!"
- else print "NO!"
read_bin_line_b y_line.c
Code:
#define UNICODE #ifdef UNICODE #define _UNICODE #else #define _MBCS #endif #include <stdio.h> #include <wchar.h> int main() { wchar_t *inname = L"test_bin.dat"; FILE *infile; wchar_t line_buffer[BUFSIZ]; /* BUFSIZ is defined if you include stdio.h */ infile = _wfopen(inname,L"rb"); if (!infile) { wprintf(L"\nfile '%s' not found\n", inname); return 0; } wprintf(L"\n%s\n\n", inname); /*Reading data from file into temporary buffer*/ while (fread(line_buffer,1,13,infile)) { /* note that the newline is in the buffer */ if ( wcscmp ( L"ب" , line_buffer ) == 0 ){ wprintf(L"YES!\n"); }else{ wprintf(L"NO!\n", line_buffer); } } /*Closing File*/ fclose(infile); return 0; }
Code:
gcc -o read_bin_line_by_line read_bin_line_by_line.c
Code:
test_bin.dat YES!
This method is VERY LONG! and NOT POWERFUL (i m beginner in software engineering)
Please any one know how to read Unicode file ? (i know its not easy!) Please any one know how to convert Unicode file to Binary file ? (simple method) Please any one know how to read Unicode file in binary mode ? (i m not sure)
Thank You.
Comment