So I've inherited this database with some invalid xml data in one table. I know for a fact that the character is a control char and I can find it in the column (which is of type XML):
The statement fails with message: "XML parser error: character: 603, line:1, column:603 Illegal control character SQLCODE=-888, ODBC 3 State="HY000" Any ideas how to get rid of that character? (Running SQLA 12) Thank you! EDIT Looks like Volker's suggestion to first update the BLA_COLUMN to a Null value then to update to a valid XML, replacing char(26) with char (32), provides a workaround. Thanks Volker! asked 05 Jun '14, 06:05 tzup Volker Barth |
Is the same true if you try to set the column to null (which would look like the previous contents is parsed, too)...? EDIT: As this seems to have done the trick, it shows that
answered 05 Jun '14, 07:48 Volker Barth If that is happening it's hard to see how you would ever be able to fix it - it's odd that it could get in there in the first place unless the behaviour has been changed at some time and it is now stricter. If this is the case, it's beginning to look more like a bug than a feature :)
(05 Jun '14, 08:33)
Justin Willey
1
Setting the column to Null first, then updating it with a valid XML (ie char 26 replaced with char 32) seems to do the trick.
(05 Jun '14, 08:45)
tzup
2
I had a go reproducing the problem (on 10.0.1 and 16.0) but couldn't - I could select and update the bad XML without an issue - so I was wondering what was causing the problem. Anyway, you have a work around so that's great.
(05 Jun '14, 10:00)
Justin Willey
Replies hidden
1
12.0.1.4085 shows the same success. Also trying to turn a valid XML into an invalid one doesn't raise an error, such as:
(05 Jun '14, 10:39)
Volker Barth
3
SQL Anywhere only validates XML when parsing it (for example, using OPENXML). It is not validated when inserting or updating a value, or casting to the XML type. I wonder if the original database has a trigger, computed column, or check constraint that ends up parsing the XML.
(05 Jun '14, 14:31)
Ivan T. Bowman
Replies hidden
1
You guys are right. Having looked more carefully at the table, I've notice the presence of a trigger that indeed does some parsing on update (disabling the trigger allowed the updates of fixing the xml just fine). Can't believe that didn't cross my mind! So there you go, case closed. Thank you!
(06 Jun '14, 00:00)
tzup
|
I think you need to convert the xml data to something that SQLA doesn't try to interpret. eg (untested) create variable mytext long varchar; set mytext = (SELECT (BLA_COLUMN) FROM BLA_TABLE where ID_RECORD = 1234); set mytext= REPLACE(mytext, char(26), char(32)); update BLA_TABLE set BLA_COLUMN = mytext where ID_RECORD = 1234; commit; If you have to do this a lot you could write a user defined function. Also if you have characters which have problems because of collation tables eg update person set notes=replace(notes,'ú','£')and every u in the field will also be changed as the collation sequence helpfully reckons that 'ú' is the same as 'u', you can use this approach: create function BinaryReplace(in x long varchar,in targetascii smallint, in replacementascii smallint) returns long varchar deterministic begin declare rv long varchar; declare l integer; declare i integer; declare c char(1); set l=length(x); set i=0; set rv=''; while i < l loop set i=i+1; set c=substr(x,i,1); if ascii(c) = targetascii then set c=char(replacementascii) end if; set rv=rv+c end loop; return rv end; not fast, but quicker than typing! answered 05 Jun '14, 06:47 Justin Willey Weird! I can't even run a statement like: UPDATE BLA_TABLE set BLA_COLUMN = '<bla/>' where ID_RECORD=1234; without the XML parser complaining as above
(05 Jun '14, 07:43)
tzup
Replies hidden
Is Breck's method allowed to work without the parser complaining?
(05 Jun '14, 08:25)
Justin Willey
|
Also untested...
Same error as above unfortunately
Do you have a support contract with SAP - it's beginning to look like you may need to open a case.
But in the meantime: The docs say: You can cast between the XML data type and any other data type that can be cast to or from a string. Note that there is no checking that the string is well-formed when it is cast to XML.
So it's looking a bit like you can write in anything, but can only read out well formed XML.
To check this can you try just selecting the XML into a string without trying to write anything back - I'm just wondering if there is more than one illegal character which is why Breck's method fails.
ie something like
OR
and see if you can then read what is in mytext.
Reading works fine in both cases. The updating with a valid XML proved to be tricky. (Please see my Edit)