ERIC Number: EJ1143599
Record Type: Journal
Publication Date: 2017
Pages: 7
Abstractor: As Provided
ISBN: N/A
ISSN: ISSN-0883-2323
EISSN: N/A
Scraping EDGAR with Python
Ashraf, Rasha
Journal of Education for Business, v92 n4 p179-185 2017
This article presents Python codes that can be used to extract data from Securities and Exchange Commission (SEC) filings. The Python program web crawls to obtain URL paths for company filings of required reports, such as Form 10-K. The program then performs a textual analysis and counts the number of occurrences of words in the filing that reflect, for example, uncertainty (or any other quality specified by the researcher). The program can be easily modified to conduct other searches by changing the word list, company names, or SEC filings. The Python program could be used in an introductory graduate data analytics course in finance that has a web crawling or textual analysis component.
Descriptors: Information Retrieval, Search Engines, Search Strategies, Online Searching, Electronic Libraries, Business Administration Education, Graduate Study, Data Analysis, Data Processing, Educational Technology, Technology Uses in Education
Routledge. Available from: Taylor & Francis, Ltd. 530 Walnut Street Suite 850, Philadelphia, PA 19106. Tel: 800-354-1420; Tel: 215-625-8900; Fax: 215-207-0050; Web site: http://www.tandf.co.uk/journals
Publication Type: Journal Articles; Reports - Descriptive
Education Level: Higher Education; Postsecondary Education
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A