Beruflich Dokumente
Kultur Dokumente
Image Hunter
Image Hunter
1.3 Introduction
1.3.1 Introduction:
The variety of environmental data (text, diagrams, images, videos, and other types of measurements), together with the fact that they are stored in worldwide distributed databases, creates new challenges when attempting to retrieve relevant information. Furthermore, this information is usually attached to time and space.This project focuses on using both LOCAL and GLOBAL features of the image to retrieve relevant information.Also to detect piracy by its application
Image Hunter
1.3.3.1 New feature extraction method that simultaneously captures the global and local characteristics of an image by adaptively computing hierarchical geometric centroids of the image. 1.3.3.2Since relevant information is retrieved from DB searches the searching time is saved tremendously. 1.3.3.3 Ranked results provide clear idea about the gap between the contentsearched and the searched result(Currently GOOGLE search engine uses the PAGE RANK ALGORITHM, but the semantic gap between the images cannot be found out) 1.3.3.4To solve issues associated with copy-right infringement and content pirating through Near Duplicate Image Detection 1.3.3.5 This can be applied in forensic sciences in which it is possible to detect a culprit based on the content in live footage. 1.3.3.6 Finds its application in semantic image analysis in medical diagnosis
Image Hunter
2. System Study
2.1 Existing System (Advantages and Disadvantages of existing System):
The system to be designed can be thought of as an alternate to existing system and also a new self contained product. Content based image retrieval is the alternate for the tag based searching. Near duplicate search for searching morphed images is new self contained product for identifying the morphed images that can help in stopping piracy and copyright infringement.
Image Hunter
2.5 Constraints:
2.5.1 Images are sensitive to illumination as there features change due to illumination. 2.5.2Features of the image, both LOCAL and GLOBAL are having their own weaknesses. 2.5.3The results obtained are approximate not exact. 2.5.4These systems usually fail when the contrast of image is changed.
Image Hunter
Image Hunter
3.1.4 References:
[2] Near-Duplicate Image Recognition and Content-based Image Retrieval using Adaptive Hierarchical Geometric Centroids Mai Yang, Guoping Qiu, Jiwu Huang and Dave Elliman, The 18th International Conference on Pattern Recognition 2006 [3]Image Retrieval: Ideas, Influences, and Trends of the New Age RITENDRA DATTA, DHIRAJ JOSHI, JIA LI, and JAMES Z. WANG [4]Image Retrieval using Shape Feature S.Arivazhagan, L.Ganesan, S.Selvanidhyananthan [5]Region Based Image Similarity Search - Prof. Dr. H.-J. Schek
Image Hunter
Image Hunter
Image Hunter
3.3.1.6 User shall be able to see the ranked result: Once the database is queried with respect to a image in both types of searching, the results are displayed to the users based on the ranking of the image. More the distance lower is the ranking.
Image Hunter
Performance
Performance is measured by recall and precision values which will not be less than 85%.
Security
There are 2 users for the system. General user is unauthenticated as there is no privilege to alter the database. As admin populates the database, Admin is always authenticated.
Availability
System is available through on a single system on which it is installed, provided there is also the availability of the database system.
Usability
This system finds it use in medical applications, forensic science and the search engines.
Interoperability
This is independent software. This can be made to operate with the database of similar applications.
Image Hunter
The document can be used by project guide to guide about the progress of the project and also provide guidelines about the suitable changes to be made in later part of project development.
Peers can use this document to test the project for the requirements stated. Suggest any improvements.
Image Hunter
This project mainly focuses on the software part of CBIR and NDIR. Hence the hardware interfaces is not applicable. In future, this can be extended to the hardware domain where in the search result appears on the live stream for a video camera.
Image Hunter
Test Id 1
browsing database should be for the feature set of the image. And the admin given upload should chance one be to more enters name the should the system
image. Admin logs into the When user system alphanumeric password system using correct and system Else, user
password,
continue its execution. should halt there till the correct user name and 3 User password is entered searches On
browsing an image interface for attributes should appear on the screen. Here the user is permitted allowed to make between NDIR. 3.1 User chooses CBIR Then the ranked a choice and CBIR
result set will appear as per the content 3.2 User chooses NDIR based search. Then the ranked 15
Image Hunter
result set will appear as 4 per the near duplicate search. User chooses the In this case user attributes searching. for specifies the attributes like rank range and distance range. The result of searching will 5 follow the attributes. User chooses the As the algorithm can tuning factor. be tuned for the involvement of colour and structure, users are allowed to tune between the two. That means, between a these point two
values can be taken. 6 User sees This refines the result. the The searching should happen semantics conceptually on the screen. as and per the ranked
ranked result.
16
Image Hunter
17
Image Hunter
4.1Introduction
4.1.1 Summary:
4.1.1.1 Purpose of project The intended system shall be able to identify the Content related images in the database. Here the retrieved images should be content related to the query image. Also, the intended system shall be able to find out the Near duplicate set of images from a database with reference to a query image. 4.1.1.2 Scope of Project The system to be designed searches for the image based on semantics of the queried image and the results are ranked based on the conceptual distance with queried image. In case of the near duplicate search, the morphed images are identified. The results are then ranked with conceptual distance. This ranking can be alternative to the traditional page ranking of image (done on the number of visits). The advantage is that, the rank will never change until the content of image changes (Unlike, page rank where ranking changes in short intervals). 4.1.1.3 Intended audience This a project about a college level, the intended audience are as follows. i.Project Guide ii.Evaluation committee iii.Peers i.Project Guide can use this document for monitoring the progress in the development of the project and set guidelines according to it. ii.Evaluation Committee can use this document to map the requirements quoted and the detailed design document. Also check for the completeness and unambiguous requirements. iii.Peers can use this document this to suggest any changes needed and give reviews on the project.
4.1.2 Terminology
Some of the terminologies used in the document are Feature: Feature vector that describes the feature set of the image. 18
Image Hunter
Distance: This refers to the Canberra distance between two vectors (image vectors) Tuning factor: Refers to the in the distance formula for the CBIR distance based ranking.
Non Goals: Provide interface to other applications which need service of the project. (i.e if the system is supposed to be a subsystem of larger subsystem, the interface is not provided in the current scope)
Image Hunter
Presentation logic: The processing (instructions, routines, etc.) required to display or print data. It typically refers to the execution of the user interface (GUI). Business logic: describe the functional algorithms that handle information exchange between a database and a user interface Data access logic: Which provides simplified access to data stored in persistent storage of some kind, such as an entity-relational database?
The block diagrams for both the intended users are as follows. 20
Image Hunter
4.2.1.1 ADMIN:
4.2.1.2 USER:
ABSTRACT SPECIFICATION: 21
Image Hunter
User: i) Query Image: Browse an image from Local Hard Drive. And query the image. ii) Extract Features: Feature set is extracted one at a time from database. Feature retrieved from DB is matched with features Canberra distance Sort the distance in ascending order and Display Result as per sorted distance. Compute colour at all 63 centroids. iii) Store in database: Store the features in individual column along with the path with where image is stored. of image uploaded using
Image Hunter
SEARCH_OBJECT Distance Compute_distance_CBIR () Compute_distance_NDIR () To hold Image location and Rank a simple structure can be used as below: struct image_distance { Image; Distance; } 4.3.2 Use case diagrams: 23
Image Hunter
Uploads an image
Feature Extraction
DB
ADMIN
Use case: Admin uploads an image Success Scenario: The image gets stored in a location and the database is populated with the feature set of the image.
Uploads an image
Feature Extraction
DB
USER
Results
Use case: User searches the database Success Scenario: The ordered set of results is displayed in the database. 24
Image Hunter
25
Image Hunter
2.User
26
Image Hunter
27
Image Hunter
Component Design: Feature of Extraction can be divided into two parts Centroid features: Hierarchical centroid. Here the hierarchy of centroids is computed first.
28
Image Hunter
Colour features: Colour at centroids. In this subsystem only the colour at given centroid is computed.
iv. Find the distance with respect to all images in database. v. Sort all the distances in ascending order. The one with least distance is the most near duplicate image vi The results can be ranked on distance
4.5.2.2 i. ii.
Feature vector also includes colour of all 63 points. Thus now feature vector is now containing 126 + 63 = 169 features. 29
Image Hunter
iii. iv. v.
Feature vector also includes colour of all 63 points. Thus now feature vector is now containing 126 + 63 = 169 features. Then extract features. Find the distance using
vi.
Set the value of , which is the tuning factor that decides upon whether a user prefers structure or colour
4.6 Logging
Not applicable to this project.
4.7 Exceptions
System should warn when the image type that is uploaded is not of relevant type. Warning should be thrown when the image uploaded is more than some threshold value.
4.8 Localization
This project can find its localization in Medical diagnosis. Forensic science. Search engines etc
This system can be built on existing computing architectures. 4.9 Dependencies The algorithm proposed for designing this system is Adaptive Geometric Centroid Algorithm which works on the concept of centroids of matrices. The algorithm assumes the following i. Centroids are invariant to scaling of image. ii. Centroids are also invariant to the illumination of the image. Since the system to be designed is an independent entity, there are no dependencies.
30
Image Hunter
The deployment of the system is done on the single machine on which both application and the database runs.
31
Image Hunter
5. References
[1] http://www.agilemodeling.com/artifacts/deploymentDiagram.htm [2] Near-Duplicate Image Recognition and Content-based Image Retrieval using Adaptive Hierarchical Geometric Centroids Mai Yang, Guoping Qiu, Jiwu Huang and Dave Elliman, The 18th International Conference on Pattern Recognition 2006 [3]Image Retrieval: Ideas, Influences, and Trends of the New Age RITENDRA DATTA, DHIRAJ JOSHI, JIA LI, and JAMES Z. WANG [4]Image Retrieval using Shape Feature S.Arivazhagan, L.Ganesan, S.Selvanidhyananthan [5]Region Based Image Similarity Search - Prof. Dr. H.-J. Schek
32
Image Hunter
6. Appendix
Continuous work is going on with respect to Content Based Image Search and the near duplicate searching. One beta version of search engine that operates on the visual query is available at http://www.tiltomo.com/ Google is also working on CBIR and its first iterate (prototype) is available at http://similar-images.googlelabs.com/
33