pythondata-fitting

Finding the point of a slope change as a free parameter


Say I have two lists of data as follows:

x = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]
y = [1, 2, 3, 4, 5, 6, 8, 10, 12, 14]

That is, it's pretty clear that merely fitting a line to this data doesn't work, but instead the slope changed at a point in the data. (Obviously, one can pinpoint from this data set pretty easily where that change is, but it's not as clear in the set I'm working with so let's ignore that.) Something with the derivative, I'm guessing, but the point here is I want to treat this as a free parameter where I say "it's this point, +/- this uncertainty, and here is the linear slope before and after this point."

Note, I can do this with an array if it's easier. Thanks!


Solution

  • Here is a plot of your data:

    enter image description here

    You need to find two slopes (== taking two derivatives). First, find the slope between every two points (using numpy):

    import numpy as np 
    x = np.array([1, 2, 3, 4, 5, 6, 7, 8, 9, 10],dtype=np.float)
    y = np.array([1, 2, 3, 4, 5, 6, 8, 10, 12, 14],dtype=np.float)
    m = np.diff(y)/np.diff(x)
    print (m)
    # [ 1.  1.  1.  1.  1.  2.  2.  2.  2.]
    

    Clearly, slope changes from 1 to 2 in the sixth interval (between sixth and seventh points). Then take the derivative of this array, which tells you when the slope changes:

    print (np.diff(m))
    [ 0.  0.  0.  0.  1.  0.  0.  0.]
    

    To find the index of the non-zero value:

    idx = np.nonzero(np.diff(m))[0]
    print (idx)
    # 4
    

    Since we took one derivative with respect to x, and indices start from zero in Python, idx+2 tells you that the slope is different before and after the sixth point.