One of the common requests we get from our clients is that they want to search just specific sections of 10-K filings. The most common sections they look to index are the Risk Factors (Item 1A) and the MD&A (Item 7). With our release of 126.96.36.199 this is now possible. You have to download the item sections to your local computer and then you have to run the indexing software built into the Search Extraction & Normalization Engine.
The process to download the various Items from the 10-K to your local computer is described in the Help file with the Search Extraction & Normalization engine. Briefly, you need a CSV file with a list of CIKs, the years and the period focus (PF). I am going to cheat in this example and use the list of most recently filed 10-Ks to generate my request file.
You can see my request file in the image above. However, to show off a bit I also included a screenshot of our internal application that tracks the handling of each filing. The image above is a bit fuzzy – the first CIK in the list is Monsanto (1110783). If you go to EDGAR and check for the filing date and time of their 2016 10-K you will see that it was filed on EDGAR on 10/19/2016 at 3:31 CT ( 4:31 ET). The details in our log above tell me that it was pushed to our server at 3:53 CT (4:53 ET). So this Risk Factors section was available for our clients in less than 30 minutes after the filing was made on EDGAR. Pretty amazing.
Once the request file has been created, the application interface has the controls to use to query our server and pull from it the actual Risk Factors sections for the list of CIK – Year pairs.
The process runs very fast, typically we can push more than 500 files a minute to your local computer. However, time of day, our network and your network load all affect the download speed. When the process is complete all the risk factor sections for your CIK-YEAR pairs are in the directory you specified. They are named using the standard notation we use with the CIK, the date the filing is made available on EDGAR and for these particular filings the balance sheet date. Each of these files can be individually viewed using our SmartBrowser.
As you can see we have controls to allow you to cycle through these quickly. However, if you in-fact want to have the full range of directEDGAR features to use with these files they need to be indexed. The indexing process requires that you select a collection of files and specify a destination for the files. The application will handle all of the intermediate steps.
When the indexing is complete the application automatically adds the index to a directEDGAR_CUSTOM index library and that library is then selected as the active library. So it is only necessary to select the index and begin searching.
The search results will have the hit highlighting and hit-to-hit navigation features. You can extract context around search results, extract tables, get word counts – in short all of the features associated with the primary indexes are made available on these custom indexes. And while I ran a simple search above, you can run as complex a search as you need because you have the full power of our search engine managing the background processes.