org.jpedal.grouping.PdfGroupingAlgorithms.extractTextAsWordlist()
algorithm to place data from within coordinates to a vector of word, word coords (x1,y1,x2,y2)
@param x1 is the x coord of the top left corner
@param y1 is the y coord of the top left corner
@param x2 is the x coord of the bottom right corner
@param y2 is the y coord of the bottom right corner
@param page_number is the page you wish to extract from
@param breakFragments will divide up text based on white space characters
@param punctuation is a string containing all values that should be used to divide up words
@return Vector containing words found and words coordinates (word, x1,y1,x2,y2...)
@throws PdfException If the co-ordinates are not valid