此mbrtowc()
函数用于将多字节序列转换为宽字符串。这将返回多字节字符的长度(以字节为单位)。语法如下。
mbrtowc (wchar_t* wc, const char* s, size_t max, mbstate_t* ps)
参数是-
wc是指向结果宽字符存储位置的指针。
s是指向多字节字符串作为输入的指针
max是s中可以检查的最大字节数
ps在解释多字节字符串时指向转换状态。
#include <bits/stdc++.h> using namespace std; void display(const char* s) { mbstate_t ps = mbstate_t(); // initial state int s_len = strlen(s); const char* n = s + s_len; int len; wchar_t wide_char; while ((len = mbrtowc(&wide_char, s, n - s, &ps)) > 0) { wcout << "The following " << len << " bytes are for the character " << wide_char << '\n'; s += len; } } main() { setlocale(LC_ALL, "en_US.utf8"); const char* str = u8"z\u00cf\u7c38\U00000915"; display(str); }
输出结果
The following 1 bytes are for the character z The following 2 bytes are for the character Ï The following 3 bytes are for the character 簸 The following 3 bytes are for the character क