pythonxmlpython-requests

The XML file I download using requests.get is different from when I download using firefox. What am I doing wrong?


Link: https://www.sec.gov/Archives/edgar/data/1965040/000112760224010272/

The file I am downloading is form4.xml

The code I use is:

import requests
headers = {
    'User-Agent': 'sec_datascrape',
    'From': 'myemail@gmail.com',
    'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8',
}
url = "https://www.sec.gov/Archives/edgar/data/1965040/000112760224010272/form4.xml"

r = requests.get(filing_href, headers=headers)
with open('data.xml', 'wb') as f:
        f.write(r.content)

The content I get from the code above is very different from when I download directly using the link from the Firefox web browser. Not sure what I am doing wrong.

A snippet of the data I get:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<meta http-equiv="Last-Modified" content="Fri, 15 Mar 2024 20:49:57 GMT" />
<title>EDGAR Filing Documents for 0001127602-24-010272</title>
<link  rel="stylesheet" href="/edgar/search/global/css/bootstrap/bootstrap.min.css" type="text/css" />
<link rel="stylesheet" type="text/css" href="/include/interactive2.css" />
</head>
<body style="margin: 0; font-size: 16px; ">
<!-- SEC Web Analytics - For information please visit: https://www.sec.gov/privacy.htm#collectedinfo -->
<noscript><iframe src="//www.googletagmanager.com/ns.html?id=GTM-TD3BKV"
height="0" width="0" style="display:none;visibility:hidden"></iframe></noscript>
<script>(function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':
new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],
j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src=
'//www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);
})(window,document,'script','dataLayer','GTM-TD3BKV');</script>
<!-- End SEC Web Analytics -->
<noscript><div style="color:red; font-weight:bold; text-align:center;">This page uses Javascript. Your browser either doesn't support Javascript or you have it turned off. To see this page as it is meant to appear please use a Javascript enabled browser.</div></noscript>
<!-- BEGIN BANNER -->
<div  id="header" style="text-align: center;">
   <nav id="main-navbar" class="navbar navbar-expand">
      <ul class="navbar-nav">
         <li class="nav-item">
            <a class="nav__sec_link" href="https://www.sec.gov">
               <img src="/edgar/search/images/edgar-logo-2x.png" alt="" style="height:6.25rem">
            </a>
         </li>
         <li class="nav-item">
            <a class="nav__sec_link" href="https://www.sec.gov">
               <span class="link-text d-inline">SEC.gov</span>
            </a>
         </li>
         <li class="nav-item">
            <a class="nav__link" href="//www.sec.gov/submit-filings/about-edgar" id="edgar-short-form"><span class="link-text">EDGAR</span></a>
         </li>
      </ul>

      <ul class="navbar-nav ml-auto">
         <li class="nav-item">
            <a href="/cgi-bin/browse-edgar?action=getcurrent" class="nav__link">Latest Filings</a> 
         </li>
         <li class="nav-item">
            <a href="https://www.sec.gov/edgar/search-and-access" class="nav__link">Filings search tools</a>
         </li>
      </ul>
   </nav>
   <div style="position: absolute;width: 100%;"> <h1 style="position: relative;top: -60px;">Filing Detail</h1></div>
</div>
<!-- END BANNER -->


<!-- BEGIN BREADCRUMBS -->
<div id="breadCrumbs">
   <ul>
      <li><a href="/index.htm">SEC Home</a> &#187;</li>
      <li><a href="/edgar/searchedgar/companysearch.html">Company Search</a> &#187;</li>
      <li class="last">Current Page</li>
   </ul>
</div>
<!-- END BREADCRUMBS -->

What I am expecting:

<?xml version="1.0"?>
<ownershipDocument>

    <schemaVersion>X0508</schemaVersion>

    <documentType>4</documentType>

    <periodOfReport>2024-03-13</periodOfReport>

    <issuer>
        <issuerCik>0001965040</issuerCik>
        <issuerName>Fortrea Holdings Inc.</issuerName>
        <issuerTradingSymbol>FTRE</issuerTradingSymbol>
    </issuer>

    <reportingOwner>
        <reportingOwnerId>
            <rptOwnerCik>0001983802</rptOwnerCik>
            <rptOwnerName>Warren Amanda M.</rptOwnerName>
        </reportingOwnerId>
        <reportingOwnerAddress>
            <rptOwnerStreet1>8 MOORE DRIVE</rptOwnerStreet1>
            <rptOwnerStreet2></rptOwnerStreet2>
            <rptOwnerCity>DURHAM</rptOwnerCity>
            <rptOwnerState>NC</rptOwnerState>
            <rptOwnerZipCode>27709</rptOwnerZipCode>
            <rptOwnerStateDescription></rptOwnerStateDescription>
        </reportingOwnerAddress>
        <reportingOwnerRelationship>
            <isOfficer>1</isOfficer>
            <officerTitle>Chief Accounting Officer</officerTitle>
        </reportingOwnerRelationship>
    </reportingOwner>

    <aff10b5One>0</aff10b5One>

    <derivativeTable>
        <derivativeTransaction>
            <securityTitle>
                <value>Restricted Stock Unit</value>
            </securityTitle>
            <conversionOrExercisePrice>
                <footnoteId id="F1"/>
            </conversionOrExercisePrice>
            <transactionDate>
                <value>2024-03-13</value>
            </transactionDate>
            <transactionCoding>
                <transactionFormType>4</transactionFormType>
                <transactionCode>A</transactionCode>
                <equitySwapInvolved>0</equitySwapInvolved>
            </transactionCoding>
            <transactionTimeliness></transactionTimeliness>
            <transactionAmounts>
                <transactionShares>
                    <value>3856</value>
                </transactionShares>
                <transactionPricePerShare>
                    <value>0</value>
                </transactionPricePerShare>
                <transactionAcquiredDisposedCode>
                    <value>A</value>
                </transactionAcquiredDisposedCode>
            </transactionAmounts>
            <exerciseDate>
                <footnoteId id="F2"/>
            </exerciseDate>
            <expirationDate>
                <footnoteId id="F2"/>
            </expirationDate>
            <underlyingSecurity>
                <underlyingSecurityTitle>
                    <value>Common Stock</value>
                </underlyingSecurityTitle>
                <underlyingSecurityShares>
                    <value>3856</value>
                </underlyingSecurityShares>
            </underlyingSecurity>
            <postTransactionAmounts>
                <sharesOwnedFollowingTransaction>
                    <value>18825</value>
                    <footnoteId id="F3"/>
                </sharesOwnedFollowingTransaction>
            </postTransactionAmounts>
            <ownershipNature>
                <directOrIndirectOwnership>
                    <value>D</value>
                </directOrIndirectOwnership>
            </ownershipNature>
        </derivativeTransaction>
    </derivativeTable>

    <footnotes>
        <footnote id="F1">Each Restricted Stock Unit (&quot;RSU&quot;) represents the right to receive, at settlement, one share of Fortrea Holdings Inc. (&quot;Fortrea&quot;) Common Stock. This transaction represents the settlement of RSUs in shares of Common Stock on their scheduled vesting date.</footnote>
        <footnote id="F2">The RSUs vest in three equal annual installments beginning on March 13, 2025.</footnote>
        <footnote id="F3">This number reflects the aggregate number of RSUs held by the reporting person.</footnote>
    </footnotes>

    <ownerSignature>
        <signatureName>/s/ James Stillman Hanson, Attorney-in-Fact for Amanda M. Warren</signatureName>
        <signatureDate>2024-03-15</signatureDate>
    </ownerSignature>
</ownershipDocument>

Solution

  • Thanks to @furas for pointing the error I made, I was using url: "

    https://www.sec.gov/Archives/edgar/data/1965040/000112760224010272"
    

    instead of

    https://www.sec.gov/Archives/edgar/data/1965040/000112760224010272/form4.xml"
    

    which caused the issue. I was able to download the file once I corrected this mistake even with my original headers. Many thanks to you all for the input.