pythonyamlpyyaml

How to read a python tuple using PyYAML?


I have the following YAML file named input.yaml:

cities:
  1: [0,0]
  2: [4,0]
  3: [0,4]
  4: [4,4]
  5: [2,2]
  6: [6,2]
highways:
  - [1,2]
  - [1,3]
  - [1,5]
  - [2,4]
  - [3,4]
  - [5,4]
start: 1
end: 4

I'm loading it using PyYAML and printing the result as follows:

import yaml

f = open("input.yaml", "r")
data = yaml.load(f)
f.close()

print(data)

The result is the following data structure:

{ 'cities': { 1: [0, 0]
            , 2: [4, 0]
            , 3: [0, 4]
            , 4: [4, 4]
            , 5: [2, 2]
            , 6: [6, 2]
            }
, 'highways': [ [1, 2]
              , [1, 3]
              , [1, 5]
              , [2, 4]
              , [3, 4]
              , [5, 4]
              ]
, 'start': 1
, 'end': 4
}

As you can see, each city and highway is represented as a list. However, I want them to be represented as a tuple. Hence, I manually convert them into tuples using comprehensions:

import yaml

f = open("input.yaml", "r")
data = yaml.load(f)
f.close()

data["cities"] = {k: tuple(v) for k, v in data["cities"].items()}
data["highways"] = [tuple(v) for v in data["highways"]]

print(data)

However, this seems like a hack. Is there some way to instruct PyYAML to directly read them as tuples instead of lists?


Solution

  • I wouldn't call what you've done hacky for what you are trying to do. Your alternative approach from my understanding is to make use of python-specific tags in your YAML file so it is represented appropriately when loading the yaml file. However, this requires you modifying your yaml file which, if huge, is probably going to be pretty irritating and not ideal.

    Look at the PyYaml doc that further illustrates this. Ultimately you want to place a !!python/tuple in front of your structure that you want to represented as such. To take your sample data, it would like:

    YAML FILE:

    cities:
      1: !!python/tuple [0,0]
      2: !!python/tuple [4,0]
      3: !!python/tuple [0,4]
      4: !!python/tuple [4,4]
      5: !!python/tuple [2,2]
      6: !!python/tuple [6,2]
    highways:
      - !!python/tuple [1,2]
      - !!python/tuple [1,3]
      - !!python/tuple [1,5]
      - !!python/tuple [2,4]
      - !!python/tuple [3,4]
      - !!python/tuple [5,4]
    start: 1
    end: 4
    

    Sample code:

    import yaml
    
    with open('y.yaml') as f:
        d = yaml.load(f.read())
    
    print(d)
    

    Which will output:

    {'cities': {1: (0, 0), 2: (4, 0), 3: (0, 4), 4: (4, 4), 5: (2, 2), 6: (6, 2)}, 'start': 1, 'end': 4, 'highways': [(1, 2), (1, 3), (1, 5), (2, 4), (3, 4), (5, 4)]}