IATI Consultations Archive

Live discussions and consultations can be found at discuss.iatistandard.org.

Fixing invalid XML Unicode issues - 0x5 (issue with some World Bank Files)

Some of the World Bank XML files currently in the registry include Unicode character 0x5, which isn't a valid XML character, and so many parsing tools will reject these files or fail to process them.

In PHP I've dealt with this with a simple search and replace running each file through the command: 

$data = preg_replace("#[\x5]#","",$data); 

There's some details about 0x5 and the XML Specification that helped me identify the issue here: http://osdir.com/ml/text.xml.exist/2004-10/msg00123.html

 


Have more questions? Submit a request

0 Comments

Please sign in to leave a comment.