Section 4. Information technology
Section 4. Information technology
Muminov Bahodir.Boltayeich, Ph. D., Senior Researcher, Tashkent University of Information Technology Tashkent, Uzbekistan.
E-mail: [email protected]
The calculating rating of electronic resources
Abstract: The rating of electron resources is devoted to count by theories, directions in this work. The calculating model of rating of ER by entering and exiting directions on bases of used widely PageRank is produced for calculating the rating of web pages in Google searching system. The rating of ER is taken into account for calculating the ratings of entering direction and the calculating exiting direction is accomplished by equitable distribution of ER. And also the calculating rating ER among kinds fields by entering distribution rating term is given for calculating rating of ER. And four definitions re included in it for calculating ratings. The calculating of ER rating is very important for searching information and submission of it.
Keywords: Electron resource, calculating excerption, calculating rating, PageRank algorithm, searching information, bestowing information, direction, graph, electron resource center, information.
I. Introduction
The causes of volumes and numbers of electron resources' proliferation is increasing two options of information copies in last years. One of popular and widespread methods of marking the quality of information is the rating of bringing excerption method [1]. The list of books which given excerption in scientific works or hyper direction among electron resources (ER) (web pages) may be an example to excerptions. The bringing excerption rating idea is consist of finding its quality mark by the volumes and qualities of ER.
II. Main part
The lists ofused literatures in ER and the analogy among directions of certain pages in web pages may be actualized. Calculating ER directions from different resources gives the relevance of ER, importance or the approximate values of quality. PageRank algorithm counts the number of directions, regulates ER by the number of directions in ER on bases of equality of ER directions' importance and also expands it. The rating of ER by PageRank is found the following condition [3]:
Let's imagine, Tl....Tn ERs are linked in A ER. Parameter d belongs to (0,1) space, and also it is diminishing coefficient. Usually, d = 0,8. Coefficient d is used for restricting the number of accessions. C(T) function marks the number of exiting direction from T ER. In this situation, the rating of A ER by PageRank is counted by the following formula:
In the calculating PR(A) (the rating ofA ER by PageRank) we can see that Tv..Tn rating of ER PR(Tn) are taken into account by PageRank. And so, the rating of ER which linked it is counted in determining ER rate, the rate of ER is connected to the quality of ER which linked it. It must be make a note, PageRank marks distribution ofpossibilities of all ER PageRank rates sum are equal to one for every ER.
The rate of PageRank PR(A) may be counted by using simple and many values algorithms and normalized directions is correspond to their own vectors. It must be make a note, the rate of Pag-
eRank for ten billion web pages may be counted in several hours in Server which has average strength [3].
This method of calculating ER rate is used low for graduating published works and authors in scientific field. The famous defect of this method is using equal weight denotation for all directions. If we say it differently, the author's direction which has many directions from other resources are compared to the directions whose has not directions from other resources. Outside of it, this marking isn't adequate in web field, because the main duty of this method is to count the huge number of quality directions which enter ER simply.
The searching problem of marking method of direction's quality which working in as web field is settled successfully by invented PageRank algorithm. This algorithm was invented by two investigators Sergey Brin and Lorens Peydj of Stanford university, then it works as the part of Google searching system's (www.google.com) technological base.
We think about using algorithms which is such as PageRank algorithms for calculating ER rate of Electron Resource center (ERC) is expedient.
Block quotes in ERC doesn't count lodging directions in ER, its format and the others in not having some attributes, only it may imagine the structure of direction as graph in calculating to direct from one ER to other one ER.
Let's imagine, ER are given one by one in databases (DB) and it consist of n ERs which have DocID identification (the idendificator number ofER in DB) belong to V =[l,n] distance.
Definition-1. The ordered couple (i,j)e V2 is called block quote or direction. i is exiting direction ofER and j is entering direction of ER. E selection is formed by all direction among ER in V selection, it is directed at G = (V, E) direction graph and these graphs are called the graph of directions.
Definition-2. It is G=(V, E), in here V- the final selection of graph's summit, EczV-V and i eV . The selection of entering direction is marked with I(i) and the selection of exiting direction is marked with O(i), and that is:
I (i ) = {e e E\e = (j ,i), j eV }
The calculating rating of electronic resources
O(i) = {e e EI e = (i, j), j eV} Definition-3. If there is no (i)or I(i), it is marked with {0}, ER rate is 0 for i e V .
Definition-4. Any ER doesn't give its entering direction I (i ) and exiting direction O (i ).
The rate of entering direction Iratjns is counted by the following formula:
I1 DocID |S count (I (i))I()
(2)
In here, count (I (i)) — is the number of all entering directions. The rate of exiting direction Oratjng is counted by the following formula:
_ \O(i)DocID\
O,
To
O(i)
(3)
d count (O (i))
In here, count (O(i)) — is the number of all exiting directions. The rate of ER Docr is counted by the following formula:
'DocID ' ®
( + )
_ \ 'DocID_'DocID /
~ 2
Doc
(4)
Table 1. - The calculating of direction rate
DocID The entering directions The exiting directions All rates
I (i) \щ Rate O (i ) |o (i )l Rate
0 { 0 } 0 0.0000 {(1,0), (3,0)} 2 0.1538 0.0769
1 {(0,1)} 1 0.0059 {(2,1), (3,1)} 2 0.1538 0.0799
2 {(1,2)} 1 0.0061 {(3,2), (4,2), (6,2)} 3 0.2308 0.1185
3 {(0,3), (1,3), (2,3)} 3 0.0071 {(4,3), (5,3), (6,3)} 3 0.2308 0.1189
4 {(2,4), (3,4)} 2 0.0091 {(5,4), (6,4)} 2 0.1538 0.0815
5 {(3,5), (4,5)} 2 0.0077 {(6,5)} 1 0.0769 0.0423
6 {(2,6), (3,6), (4,6), (5,6)} 4 0.0069 { 0 } 0 0.0000 0.0035
7 { 0 } 0 0.0000 { 0 } 0 0.0000 0.0000
The results which taken on bases of calculating models by bloc quote of ER rate is given in upper table. The rate is counted in connecting to entering directions' rate of ER. The exiting directions are counted by equal dividing.
We can see the semantic dependences of all ERs in ERC by the following picture. In this the entering and exiting directions of every ER are expressed.
Fig.1.
ER rate is lower when the calculating of rate by ER direction in all ERs. That's why we should count ER rate by divided rate and define their coefficients.
We can write down divided ER rate as TR.
TR = {alrl,a2r2, a3r3,...,anrn} (5)
In here, r is the fields ofER and is ER rate on bases ofthese fields.
The rate of ER fields must be also taken into consideration for calculating ER rate by block quote. It simplifies calculating the rates of ER in ERC.
Calculating ER rates by ER block quote in ERC is important for searching information from ERC and bestowing them.
III. Conclusion
The searching problem of ER in ERC is one of important problems nowadays. In the best searching from GOOGLE system in internet global system as PageRank algorithm is to count the rate of web pages. That's why, we recommend the calculating models (2), (3), (4) of ER rate. This model:
- Accelerates the working of searching information module
- Heightens one step the quality of information on bases of the opportunity of searching information by sorting
- Gives opportunity for marking correspond border to bestow information and reordering it
- Creates elements for analyzing information by block quote and intellectual searching.
We think about the opportunity of using field rate for calculating ER rate gives opportunity for producing promising plans for ER in ERC on bases of finding ER rate among fields.
References:
1. Кулинкович Т. О. Основы научного цитирования: метод. пособие для студентов и магистрантов, - Минск: БГУ, 2010. - 58 с.
2. Brin S., Page L.: The anatomy of a large-scale hyper textual Web search engine. In: Proceedings of the 7th International World Wide Web Conference, Computer Networks 30 (1-7): 107-117, 1998.
3. Page L., Brin S., Motwani R., Winograd T.: The PageRank Citation Ranking: Bringing Order to the Web. Stanford Digital Libraries Working Paper, Stanford University, 1998. - 17 p. http://ilpubs.stanford.edu:8090/422/1/1999-66.pdf