Abstract: The invention provides a system for automatic generation of robust and searchable content Comprising a platform (recording device) to create automatically and rapidly content related to businesses in a form that can be used to search for information using hand held or any other devices including mobile phones.
FORM-2 THE PATENTS ACT, 1970
(39 of 1970)
& THE PATENTS RULES, 2003
PROVISIONAL
Specification
(See Section 10 and rule 13)
SEARCHING OF DIGITAL DATA
TATA CONSULTANCY SERVICES LTD.,
an Indian Company of Bombay House, 24, Sir Homi Mody Street, Mumbai 400 001 Maharashtra,
India
THE FOLLOWING SPECIFICATION DESCRIBES THE INVENTION
This invention relates to the field of searching of digital data.
Background
Availability and correctness of searchable content is fast growing especially with the penetration of Internet in general and mobiles in particular. Most of the content these days is user generated content (UGC) but still a large amount of content Is generated manually especially when its accuracy and relevant requirement is high. People go onto the field and feed in data into an electronic system; making the task of content generation both human intensive and erroneous. This not only adds to the cost of content creation but also takes longer time to create usable and searchable content.
Objectives
The main object of this invention is to provide a mechanism for automatic creation of content by building a content creation device which captures information from multiple sources in time sync and automatically generates content. The invention facilitates creation of content using multiple cues which include
• GPS Recoding Module - Captures the latitude and longitude of the current location along with the time stamp.
• Speech/Audio Recoding Module - a speech recording system which records audio of the speaker along with a time stamp
• lmage(Video Capturing Module - a video capturing module which captures the video with time stamp.
In accordance with this invention therefore there is provided
1. A system for automatic generation of robust and searchable content Comprising a platform (recording device) to create automatically and rapidly content related to businesses in a form that can be used to search for information using hand held or any other devices including mobile phones.
3
In accordance with a preferred embodiment of the invention, the platform can provide searchable information about businesses like name of the business, their location and any other information that relates to a company or business.
Still further, the system can extract information from multiple data sources including images, speech, position locaters
Typically, the system can be configured to work on different languages (Telugu, Tamil etc) using the support of a language tool or an electronic dictionary
The invention envisages a portable device which houses a GPS module, a sound recording module and a video recording module with the additionally facility to append time stamp on the input information stream.
Also envisaged is a mechanism to play video, audio and display the location by using GPS provided latitude and longitude information for purpose of reviewing captured content.
Particularly, the video analysis module is capable of identifying text images, extraction of image text (in different fonts and styles and different languages) into electronic text.
Particularly, the speech recognition module can convert recorded audio into electronic textual information (speech to text)
Particularly, the latitude, longitude to name of place converting mechanism uses existing latitude, longitude to name of place lookup database.
Also envisaged in the system is a mechanism to combine the outputs from video analysis, speech recognition and processed latitude, longitude information and uploading into a structured database.
4
The invention will now be described with reference to the accompanying drawings, in which
Figure 1 illustrates the block diagram of the solution architecture of the system of this invention; and
Figure 2 shows the block diagram of high level content extraction in the architecture of Figure 1.
Referring to the drawings, Figure 1 gives the high level view of the invention that allows recording of content. All the recording modules sit on a battery operated recoding device (which in a specific case could be a laptop) and the microphone, the video camera and the GSM module are connected to the data capturing device through a port (typically serial, USB, fast USB but not restricted to these modalities). The audio, video and GSM recording modules are assisted by time stamping module such that the different recordings modes have the same time stamp. All the information recorded by the devices are buffered and then stored on a storage device which could be connected externally (or connected internally) to the recording device.
Figure 2 illustrates the high-level architectural view of the content extraction methodology. All the stored information is first processed to extract the time stamp and this is appended to the information extracted from the audio, video and location processing modules. The audio processing module converts the recorded speech into electronic text; the video processing module uses video processing algorithms first identifies the regions having text and further processes to identify the language and then further processes to extract electronic text from the text image. The location is identified by using the time stamped latitude, longitude and identifies the location using latitude, longitude to location mapping database. The electronic texts
5
extracted from different modalities are synchronized using the time stamp and uploaded into a database.
This invention therefore provides a system enabling the creation of content (like business address and likes of it) automatically and robustly by fusing information from derived from multiple data sources. Further the present invention provides a platform that enables fast and automatic method of generating content to make it searchable from a mobile or any other device and a correlation between the content and its spatial location.
The Key Benefits and Features of the invention are
• An efficient and fast methodology of creating content. It beats the conventional way of manually acquiring data (time consuming) or through user generated content (error prone)
• Robust because information from different sources in evaluated separately and concatenated together; the time stamp gives authenticity to the electronic text extracted from different sources.
• The recoding device can be mounted inside a vehicle which can record (video) the businesses along the driven road and the person recording speaks the business name and or any information this is relevant and the latitude and longitude is recorded (GSM recording device). This enables a fast methodology of capturing information.
• An automated method can be employed to extract electronic text from different sources of information using video and speech processing algorithms. Making this method of extraction less prone to human errors.
While considerable emphasis has been placed herein on the specific structure of the preferred embodiment, it will be appreciated that many alterations can be made and that many modifications can be made in the preferred embodiment without departing from the principles of the invention. These and other changes in the preferred embodiment as well as other embodiments of the invention will be apparent to those
6
skilled in the art from the disclosure herein, whereby it is to be distinctly understood that the foregoing descriptive matter is to be interpreted merely as illustrative of the invention and not as a limitation.
ith
Dated this 19th day of August, 2008.
7
| Section | Controller | Decision Date |
|---|---|---|
| # | Name | Date |
|---|---|---|
| 1 | 1751-MUM-2008-CORRESPONDENCE(IPO)-(FER)-(27-11-2015).pdf | 2015-11-27 |
| 1 | 1751-MUM-2008-RELEVANT DOCUMENTS [28-09-2023(online)].pdf | 2023-09-28 |
| 2 | 1751-MUM-2008-RELEVANT DOCUMENTS [26-09-2022(online)].pdf | 2022-09-26 |
| 2 | Other Document [16-08-2016(online)].pdf_89.pdf | 2016-08-16 |
| 3 | Other Document [16-08-2016(online)].pdf | 2016-08-16 |
| 3 | 1751-MUM-2008-RELEVANT DOCUMENTS [30-09-2021(online)].pdf | 2021-09-30 |
| 4 | Form 13 [16-08-2016(online)].pdf | 2016-08-16 |
| 4 | 1751-MUM-2008-IntimationOfGrant27-02-2020.pdf | 2020-02-27 |
| 5 | Examination Report Reply Recieved [16-08-2016(online)].pdf | 2016-08-16 |
| 5 | 1751-MUM-2008-PatentCertificate27-02-2020.pdf | 2020-02-27 |
| 6 | Description(Complete) [16-08-2016(online)].pdf_88.pdf | 2016-08-16 |
| 6 | 1751-MUM-2008-Written submissions and relevant documents [06-02-2020(online)].pdf | 2020-02-06 |
| 7 | Description(Complete) [16-08-2016(online)].pdf | 2016-08-16 |
| 7 | 1751-MUM-2008-ORIGINAL UR 6(1A) FORM 26-290120.pdf | 2020-01-30 |
| 8 | Correspondence [16-08-2016(online)].pdf | 2016-08-16 |
| 8 | 1751-MUM-2008-FORM-26 [24-01-2020(online)].pdf | 2020-01-24 |
| 9 | 1751-MUM-2008-HearingNoticeLetter-(DateOfHearing-27-01-2020).pdf | 2019-12-31 |
| 9 | Claims [16-08-2016(online)].pdf | 2016-08-16 |
| 10 | 1751-mum-2008-abstract(17-8-2009).pdf | 2018-08-09 |
| 10 | Abstract [16-08-2016(online)].pdf | 2016-08-16 |
| 11 | 1751-mum-2008-claims(17-8-2009).pdf | 2018-08-09 |
| 11 | RTOA-1751MUM2008-rev.pdf | 2018-08-09 |
| 12 | 1751-mum-2008-correspondence(17-8-2009).pdf | 2018-08-09 |
| 12 | POA-TCS.pdf | 2018-08-09 |
| 13 | 1751-MUM-2008-CORRESPONDENCE(25-9-2008).pdf | 2018-08-09 |
| 13 | CLAIMS-Mark+Clean.pdf | 2018-08-09 |
| 14 | 1751-MUM-2008-CORRESPONDENCE(4-11-2010).pdf | 2018-08-09 |
| 14 | abstract1.jpg | 2018-08-09 |
| 15 | 1751-mum-2008-correspondence.pdf | 2018-08-09 |
| 15 | Abstract-1751.pdf | 2018-08-09 |
| 16 | 1751-mum-2008-description(complete)-(17-8-2009).pdf | 2018-08-09 |
| 16 | 1751MUM2008-spec-reformat.pdf | 2018-08-09 |
| 17 | 1751-MUM-2008_EXAMREPORT.pdf | 2018-08-09 |
| 18 | 1751-mum-2008-form 5(17-8-2009).pdf | 2018-08-09 |
| 18 | 1751-mum-2008-description(provisional).pdf | 2018-08-09 |
| 19 | 1751-mum-2008-drawing(17-8-2009).pdf | 2018-08-09 |
| 19 | 1751-mum-2008-form 3.pdf | 2018-08-09 |
| 20 | 1751-mum-2008-drawing.pdf | 2018-08-09 |
| 20 | 1751-mum-2008-form 2.pdf | 2018-08-09 |
| 21 | 1751-MUM-2008-FORM 1(25-9-2008).pdf | 2018-08-09 |
| 22 | 1751-mum-2008-form 1.pdf | 2018-08-09 |
| 22 | 1751-mum-2008-form 2(title page).pdf | 2018-08-09 |
| 23 | 1751-mum-2008-FORM 1.pdf_1.pdf | 2018-08-09 |
| 23 | 1751-mum-2008-form 2(title page)-(17-8-2009).pdf | 2018-08-09 |
| 24 | 1751-mum-2008-form 2(17-8-2009).pdf | 2018-08-09 |
| 24 | 1751-mum-2008-form 13(17-8-2009).pdf | 2018-08-09 |
| 25 | 1751-MUM-2008-FORM 18(4-11-2010).pdf | 2018-08-09 |
| 26 | 1751-mum-2008-form 13(17-8-2009).pdf | 2018-08-09 |
| 26 | 1751-mum-2008-form 2(17-8-2009).pdf | 2018-08-09 |
| 27 | 1751-mum-2008-FORM 1.pdf_1.pdf | 2018-08-09 |
| 27 | 1751-mum-2008-form 2(title page)-(17-8-2009).pdf | 2018-08-09 |
| 28 | 1751-mum-2008-form 1.pdf | 2018-08-09 |
| 28 | 1751-mum-2008-form 2(title page).pdf | 2018-08-09 |
| 29 | 1751-MUM-2008-FORM 1(25-9-2008).pdf | 2018-08-09 |
| 30 | 1751-mum-2008-drawing.pdf | 2018-08-09 |
| 30 | 1751-mum-2008-form 2.pdf | 2018-08-09 |
| 31 | 1751-mum-2008-drawing(17-8-2009).pdf | 2018-08-09 |
| 31 | 1751-mum-2008-form 3.pdf | 2018-08-09 |
| 32 | 1751-mum-2008-description(provisional).pdf | 2018-08-09 |
| 32 | 1751-mum-2008-form 5(17-8-2009).pdf | 2018-08-09 |
| 33 | 1751-MUM-2008_EXAMREPORT.pdf | 2018-08-09 |
| 34 | 1751-mum-2008-description(complete)-(17-8-2009).pdf | 2018-08-09 |
| 34 | 1751MUM2008-spec-reformat.pdf | 2018-08-09 |
| 35 | 1751-mum-2008-correspondence.pdf | 2018-08-09 |
| 35 | Abstract-1751.pdf | 2018-08-09 |
| 36 | 1751-MUM-2008-CORRESPONDENCE(4-11-2010).pdf | 2018-08-09 |
| 36 | abstract1.jpg | 2018-08-09 |
| 37 | 1751-MUM-2008-CORRESPONDENCE(25-9-2008).pdf | 2018-08-09 |
| 37 | CLAIMS-Mark+Clean.pdf | 2018-08-09 |
| 38 | POA-TCS.pdf | 2018-08-09 |
| 38 | 1751-mum-2008-correspondence(17-8-2009).pdf | 2018-08-09 |
| 39 | 1751-mum-2008-claims(17-8-2009).pdf | 2018-08-09 |
| 39 | RTOA-1751MUM2008-rev.pdf | 2018-08-09 |
| 40 | 1751-mum-2008-abstract(17-8-2009).pdf | 2018-08-09 |
| 40 | Abstract [16-08-2016(online)].pdf | 2016-08-16 |
| 41 | 1751-MUM-2008-HearingNoticeLetter-(DateOfHearing-27-01-2020).pdf | 2019-12-31 |
| 41 | Claims [16-08-2016(online)].pdf | 2016-08-16 |
| 42 | 1751-MUM-2008-FORM-26 [24-01-2020(online)].pdf | 2020-01-24 |
| 42 | Correspondence [16-08-2016(online)].pdf | 2016-08-16 |
| 43 | 1751-MUM-2008-ORIGINAL UR 6(1A) FORM 26-290120.pdf | 2020-01-30 |
| 43 | Description(Complete) [16-08-2016(online)].pdf | 2016-08-16 |
| 44 | Description(Complete) [16-08-2016(online)].pdf_88.pdf | 2016-08-16 |
| 44 | 1751-MUM-2008-Written submissions and relevant documents [06-02-2020(online)].pdf | 2020-02-06 |
| 45 | Examination Report Reply Recieved [16-08-2016(online)].pdf | 2016-08-16 |
| 45 | 1751-MUM-2008-PatentCertificate27-02-2020.pdf | 2020-02-27 |
| 46 | Form 13 [16-08-2016(online)].pdf | 2016-08-16 |
| 46 | 1751-MUM-2008-IntimationOfGrant27-02-2020.pdf | 2020-02-27 |
| 47 | Other Document [16-08-2016(online)].pdf | 2016-08-16 |
| 47 | 1751-MUM-2008-RELEVANT DOCUMENTS [30-09-2021(online)].pdf | 2021-09-30 |
| 48 | Other Document [16-08-2016(online)].pdf_89.pdf | 2016-08-16 |
| 48 | 1751-MUM-2008-RELEVANT DOCUMENTS [26-09-2022(online)].pdf | 2022-09-26 |
| 49 | 1751-MUM-2008-CORRESPONDENCE(IPO)-(FER)-(27-11-2015).pdf | 2015-11-27 |
| 49 | 1751-MUM-2008-RELEVANT DOCUMENTS [28-09-2023(online)].pdf | 2023-09-28 |