data analysis that take an input of files and display desired output via the command line