Infobat: Mobile Drug Information Search System based on Google Cloud Vision and OCR

29/06/2020 Views : 556

I Ketut Gede Darma Putra

Introduction
The the pharmaceutical world is currently experiencing very rapid development, marked by the discovery of many new drugs and the development of drugs that previously existed. This development is needed to compensate for the many diseases that have begun to mutate into other diseases[1]. However, information about drugs such as indications, contents, and functions of these drugs is still a lot of people do not know it so that when experiencing minor illnesses people are confused about finding the right medicine. This is because media information about the world of drugs is difficult to obtain. One of the ways now to be able to get information about drugs quickly is through surfing in cyberspace. Google as a company that provides the largest web search service products in the world[2], is a fast and appropriate solution to overcome the problems mentioned above. Google has billions of web pages, so users can find the information they want, through the use of keywords and operators. Google has also used Web Search technology in other search services, including, Image Search, Google News, the price comparison site Google Product Search, the interactive Usenet archive Google Groups, Google Maps, and others[3]. Google's newest product that was developed in the last 5 years as a form of following the development of machine learning is Google Cloud Vision. Google Cloud Vision is an image analysis technology that functions to retrieve information from images, especially digital images. Google Cloud Vision can help users to understand the content contained in an image such as facial expressions, written form, and categories of objects in the picture. Google provides the Google Vision API service through the Google Cloud Platform, a platform that can be used by application makers to request the service of various API functions provided by Google [4]–[7].
Google Cloud Vision technology as the basis in this research to create a mobile application that can provide convenience for users to obtain information and recognize drugs through image recognition technology that allows us to remotely process the content of an image and to retrieve its main features[8], This application is named Infobat. Infobat has features including reading text on drug images so that they can recognize the name of the drug which is then searched for information through Google Search, a technology that has the function of getting strings of text photo images such as prints, handwriting and detecting difficult writing such as italics or colored writing. called OCR (Optical Character Recognition)[9][10]. OCR in Google Vision is divided into 2 types. OCR types are Text Detection and Document Text Detection. Infobat uses all of these functions to facilitate the user in the process of recognizing drugs through image media and obtaining various drug information from websites in the Google database.

Overview System
Infobat is an application that is used to provide drug information through writing descriptions on drugs and looking for web pages that have information about the drug being sought. Infobat application uses Google technology that can read the writing of a drug image and search engine technology to display information related to the drug being sought. The results of the search for information obtained in the form of links or web pages that have information about the drug being sought. The Infobat application requires an internet connection to run the functions that are there. A general description of the application of drugs can be seen in Figure 1.




Figure 1 Overview System

In Infobat application, there are 2 functions, namely OCR function and google search function. The OCR function is used to detect writing through drug images which will later be used as input for searching drug information data both through the Google Search function and from the drug database owned by the system. The system has a drug database that contains drug names and descriptions. Users can take drug images via the camera media or gallery function from android as input for the OCR function on the Infobat application. OCR results in the form of writing about drug information from the image that will be used as input values for information search from Google Search functions or drug databases owned by the application. The Google Search function is used to search for links or web pages that have information about drugs that are inputted by users.

Results and Discussion
The results of the design of the Infobat application design resulted in an android application that runs the google functions previously discussed and a database containing drug information in the form of a drug's name and description. Drug database is used to provide drug information in accordance with the name of the drug obtained through the OCR function in the application.
The Infobat application uses the Android Studio engine in the process of creating program code. Making program code using java language for android on Android Studio. Infobat application uses two java activities in making program code, which is MainActivity.java which contains program functions such as Google OCR and DBHelper.java to run the drug database process in the application.

Drug Information Search
Drug Information Lookup is a function to search for drug information based on the detection results of posts from the previous Google OCR function. Drug Information Search has 2 methods, namely through the function of Google Search and Retrieval of Data from the drug database.
This function uses writing data from the results of the Google OCR function to be used as a search value for drug information. The function is run when the user presses the button to search for drug information and the application displays a dialog for the selection of a drug information search method. The first method of searching for drug information that a user can choose is through the Google Search function.
The Google Search function will run when the user chooses a method of searching for drug information using the Google Search function. The first process in the Google Search function is validation in the first TextView. If the TextView does not have a string value to look for then a validation text will appear in the second TextView that there is no writing from the Google OCR results. When it has passed validation, the application will send an API Key, CSE Key that has been provided by Google Custom Search to be able to access the API functions from Google Custom Search. API Key and CSE Key declarations have previously been done in variable declaration functions. After gaining access to the Google custom search function, the function is continued by sending Google Google OCR detection results beforehand to be used as keywords for information search through the media search engine provided by the Google Custom Search API. The results of the Google Search function are data with JSON file types. After getting the results, the application will parse JSON to retrieve the required data from JSON, namely the web page link. The results of the JSON parsing will be displayed on the second TextView in the form of a number of web page links and web descriptions that have information related to the data sought, which is the result of the Google OCR function.

Figure 2 A process of selecting a search method consisting of Google Search and a drug database; b. the process of receiving a picture of a drug and then being given a website link about the information of the drug; c. a website that provides drug information
Figure 2 is the page display when running Google Search functions. This function runs the Google Custom Search function which is a function as a search engine provided by Google. This function is used to search for OCR results from drug images into the search engine functions contained in the application. The results of the Google Search function are web pages that have information about the writing of the drug being searched. Users can directly browse the web page by pressing the link that is displayed as a result of the Google Search function.

Conclusion
Infobat application is an application that is used to find information about a drug. The use of the application is done by inputting a picture of a drug and the application will provide results in the form of drug information data and a link or web page that has information about the image of the drug concerned. The Infobat application uses 2 Google technologies in its application, the Google Cloud Vision API (OCR) and the Google Custom Search API. The Google Cloud Vision API (OCR) technology is used to retrieve writing data from drug images, while the Google Custom Search API technology is used to search for links or web pages that have information related to writing from drug images that have been processed by the OCR system in the application. In addition to using Google Search technology, application users can also search for drug information through the drug database contained in the application.

References
[1]    A. Sharma, A. Jacob, M. Tandon, and D. Kumar, “Orphan drug: Development trends and strategies,” J. Pharm. Bioallied Sci., 2010.
[2]    Google, “Google,” Wikipedia, 2019. [Online]. Available: https://id.wikipedia.org/wiki/Google.
[3]    E. Schonfeld, “Google Processing 20,000 Terabytes A Day, And Growing,” TechCrunch, 2008. [Online]. Available: http://techcrunch.com/2008/01/09/google-processing-20000-terabytes-a-day-and-growing/. [Accessed: 16-Feb-2010].
[4]    G. Developer, “Cloud Vision API Features [Online],” Developer, Google, 2019. [Online]. Available: https://cloud.google.com/vision/docs/other-features.
[5]    G. Developer, “Google Custom Search [Online],” Google Developer, 2019. [Online]. Available: https://developers.google.com/custom-search/. [Accessed: 22-May-2019].
[6]    G. Developer, “Google APIs Explorer [Online],” Google Developer, 2019. [Online]. Available: https://developers.google.com/apis-explorer/#p/. [Accessed: 25-May-2019].
[7]    Google, “Cloud Vision,” Google Cloud, 2018. .
[8]    D. Mulfari, A. Celesti, M. Fazio, M. Villari, and A. Puliafito, “Using Google Cloud Vision in assistive technology scenarios,” in Proceedings - IEEE Symposium on Computers and Communications, 2016, vol. 2016-August, pp. 214–219.
[9]    A. Bissacco, M. Cummins, Y. Netzer, and H. Neven, “PhotoOCR: Reading text in uncontrolled conditions,” in Proceedings of the IEEE International Conference on Computer Vision, 2013, pp. 785–792.
[10]    M. Cutter and R. Manduchi, “Improving the accessibility of mobile OCR apps via interactive modalities,” ACM Trans. Access. Comput., vol. 10, no. 4, 2017.