Lösungsvorschlag Information Retrieval FS18: Unterschied zwischen den Versionen

Aus VISki
Wechseln zu: Navigation, Suche
K (Pre-processing and term vocabulary)
(VIII Evaluation)
(8 dazwischenliegende Versionen desselben Benutzers werden nicht angezeigt)
Zeile 2: Zeile 2:
  
  
== Boolean Retrieval ==
+
== I Boolean Retrieval ==
 
=== Standard Inverted Index ===
 
=== Standard Inverted Index ===
 
<ol start="1">
 
<ol start="1">
<li>C)</li>
+
<li>C</li>
<li>A)</li>
+
<li>A</li>
<li>B)</li>
+
<li>B</li>
<li>B)</li>
+
<li>B</li>
<li>B)</li>
+
<li>B</li>
<li>C)</li>
+
<li>C</li>
<li>B)</li>
+
<li>B</li>
<li>C)</li>
+
<li>C</li>
<li>C)</li>
+
<li>C</li>
<li>A)</li>
+
<li>A</li>
<li>D)</li>
+
<li>D</li>
 
</ol>
 
</ol>
  
Zeile 30: Zeile 30:
 
</ol>
 
</ol>
  
== III Pre-processing and term vocabulary ==
+
== II Pre-processing and term vocabulary ==
 
<ol start="20">
 
<ol start="20">
<li>A)</li>
+
<li>A</li>
<li>C)</li>
+
<li>C</li>
<li>D)</li>
+
<li>D</li>
<li>D)</li>
+
<li>D</li>
<li>C)</li>
+
<li>C</li>
<li>C)</li>
+
<li>C</li>
<li>A)</li>
+
<li>A</li>
<li>C)</li>
+
<li>C</li>
 
</ol>
 
</ol>
  
== IV Tolerant Retrieval ==
+
== III Tolerant Retrieval ==
  
 
=== Jaccard Coefficient ===
 
=== Jaccard Coefficient ===
Zeile 106: Zeile 106:
 
         <li>C</li>
 
         <li>C</li>
 
<li>A</li>
 
<li>A</li>
<li>D?</li>
+
<li>D</li>
<li>A,C</li>
+
<li>A, C</li>
 
<li>C</li>
 
<li>C</li>
 
         <li>A</li>
 
         <li>A</li>
Zeile 123: Zeile 123:
 
<li>B</li>
 
<li>B</li>
 
<li>C</li>
 
<li>C</li>
<li>90/110?? A?</li>
+
<li>A (81.8 %)</li>
 
         <li>A</li>
 
         <li>A</li>
 
<li>C</li>
 
<li>C</li>
Zeile 129: Zeile 129:
 
         <li>C</li>
 
         <li>C</li>
 
<li>A</li>
 
<li>A</li>
 
 
</ol>
 
</ol>
VIII Evaluation
 

Version vom 14. August 2019, 14:58 Uhr

If you disagree with the solution. Please state so by editing it. For the answers I filled out I'm not 100% certain and might have made additional mistakes by copying it.


I Boolean Retrieval

Standard Inverted Index

  1. C
  2. A
  3. B
  4. B
  5. B
  6. C
  7. B
  8. C
  9. C
  10. A
  11. D

Boolean Queries

  1. 1 2 3 4 5 6
  2. None
  3. 4 6
  4. 1 2 3 4 5 6
  5. 4
  6. 1 2 3 5
  7. 1 2 3 5
  8. 1 5 6

II Pre-processing and term vocabulary

  1. A
  2. C
  3. D
  4. D
  5. C
  6. C
  7. A
  8. C

III Tolerant Retrieval

Jaccard Coefficient

  1. 0
  2. 2/3
  3. 3/14
  4. 1
  5. 1/2
  6. 1
  7. 0
  8. 0

Levenshtein Distance

  1. 1
  2. 1
  3. 7
  4. 0
  5. 1
  6. 10
  7. 5
  8. 10

IV Index Compression

Standard Inverted Index

  1. C
  2. A
  3. D
  4. D
  5. B

V Ranked Retrieval

  1. 311
  2. 0
  3. 705
  4. 320
  5. 200
  6. 30
  7. 6 (2100), 5 (390), 1 (320)
  8. 15*40 + 3*30 = 690
  9. 0*30 + 11*30 = 330
  10. 11*40 + 70*30 = 2540
  11. 6 (2540)

VI Scoring

  1. A
  2. B

VII Probabilistic Retrieval

  1. B
  2. C
  3. A
  4. D
  5. A, C
  6. C
  7. A
  8. B
  9. C

VIII Evaluation

  1. A
  2. A
  3. C
  4. B
  5. A
  6. B
  7. C
  8. A (81.8 %)
  9. A
  10. C
  11. C
  12. C
  13. A