I am seeking a consultant to extract public information for all Brokers (677,500 records) from the following website:
http://brokercheck.finra.org/Search/Search.aspx
The data shall be delivered in a text file format that can be easily imported into MySQL or Access. The data that needs to be extracted is included in a pdf file for each record. The pdf file is located towards the top right corner of the screen for each record under the hyperlink titled “View Full PDF Report”. The pdf file is in a structured format and sample information to be extracted for each Brokerage Firm includes company name, CRD#, SEC#, Main Office Location, Mailing Address, Business Telephone Number, Firm Profile Information, Direct Owners & Executive Officers, Registrations, and Types of Business. Similar fields will also need to be extracted for each Broker record, although this pdf file is structured differently.
The site is protected by Captcha.
