pythonpandas

Pandas DataFrame column naming conventions


Is there any commonly used Pandas DataFrame column naming convention? Is PEP8 recommended here (ex. instance variables)?

Concious that lots of data is loaded from external sources with headers but I'm curious what is the correct approach when I have to name/rename the columns on my own?


Solution

  • Some people tend to use snake_case (lower case with underscores) so that they can access the column using period like this df.my_column

    I tend to always access columns using the df['my_column'] syntax because it avoids confusion with DataFrame methods and properties, and it easier to extend to slices and fancy indexing, so the snake case is not necessary.

    In short, I think you should use whatever is clearest to a potential reader.