I have a csv file with mime type US-ASCII and one column in the dataset look like this:
id | V_name |
---|---|
210001 | cha?ne des Puys |
210030 | M?los |
213004 | G?ll? |
213021 | S?phan |
221110 | Afd?ra |
And so on.
I would like to change those characters to:
id | V_name |
---|---|
210001 | chaine des Puys |
210030 | Milos |
213004 | Gollu |
213021 | Suphan |
221110 | Afdera |
The thing is that there are 95 rows of this kind, how can I search and replace those rows? I using the suite PDI spoon. Thanks in advance.
As @Iłya Bursov has stated, the source file you are reading doesn't provide the correct characters, it is providing the ? in the source, so if you want to correct it, you'll have to do it manually.
I don't think it is worth it, unless you know you are going to get always the same set of V_name over time and different files. In that case you could create a file to correlate the V_name in your source with the ? characters to a V_name_corrected with the correct display for the characters. This seems to be an exercise, so I would let the names as they are. In real life, I would contact the provider of the file with the incorrect character set to tell them that they need to correct the generation of the file to provide the correct characters in the file.