Channel: Active questions tagged utf-8 - Stack Overflow

↧

PHP multibyte regex not working with UTF-8 [duplicate]

May 29, 2024, 5:01 am

≫ Next: UnicodeDecodeError: 'utf-8' when debugging Python files in PyCharm Community

≪ Previous: String in utf8 format, i need to make it normal text

I have UTF-8 string that I want to search for all occurrences of img_(\d+).I have tried original

$pattern = '/img_(\d+)/u';preg_match_all($pattern, $text, $matches, PREG_OFFSET_CAPTURE);

but it gives me wrong offsets for the patterns.

I have also tried:

mb_internal_encoding('UTF-8');$pattern = 'img_(\d+)';mb_ereg_search_init($content, $pattern);$matches = [];        while ($result = mb_ereg_search_regs()) {    $matches[] = ['match' => $result[0],'offset' => mb_ereg_search_getpos() - mb_strlen($result[0]),    ];}

but it gives me the same result as preg_match_all.

However, when I run manually search with this:

$pos = mb_strpos($content, "img_1", 0);

I got correct offset.

Example code:

$str = "přílišžluťoučký img_1 kůn úpěl ďábelskéódy";$pattern = '/img_(\d+)/u';preg_match_all($pattern, $str, $matches, PREG_OFFSET_CAPTURE);print_r($matches); //gives 24 (wrong)echo mb_strpos($str, "img_1", 0); //gives 17 (correct)

How to fix this?

↧

Trending Articles

Bath man appears in court charged with attempted murder of a man...

March 16, 2015, 7:37 am

MACLEAN, Allan

July 30, 2019, 6:00 am

Black Angus Grilled Artichokes

July 16, 2016, 4:37 pm

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

Police blotter for Jan. 12

January 12, 2018, 3:30 am

99 God Status for Whatsapp, Facebook

June 5, 2016, 11:46 pm

Rajasthan Board 12th Science Result 2018 name wise- RBSE 12th commerce result...

May 26, 2018, 9:35 pm

Notorious Naushad of Ippa gang nabbed

July 19, 2019, 6:37 am

Child Kidnapping: Amy McNeil was kidnapped on her way to school by 5 adults;...

February 5, 2017, 10:40 am

Sonible Smartlimit v1.1.5-R2R

April 16, 2024, 7:10 am

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

December 22, 2016, 3:50 am

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

February 13, 2020, 3:12 am

Arrow Flash 2 – Sinhala Dubbed – Episode 23 – 20th March 2016

March 20, 2016, 9:39 am

[GET] AI Traffic Goldmine

July 6, 2025, 4:23 am

[E² Plugin] HDF-Radio

January 26, 2025, 9:02 am

Universal Multi-Patch v1.3 By RADIXX11

January 29, 2018, 2:45 pm

IWAN – Thanks and Praise ( Throw Back Thursday )

March 9, 2016, 11:43 pm

RONALD P SONDERGAARD Arrested by Miami-Dade County Corrections on Mar 03, 2017

March 3, 2017, 6:25 am

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

May 17, 2020, 2:04 pm

HSSC Excise & Taxation Inspector Result 2017 Scorecard/ Category Wise Merit List

July 29, 2017, 2:44 am

© 2025 //www.rssing.com