This page shows visualizations of some width-4 1-d convolutional filters from Google's lm_1b language model. Each column corresponds to one position in the filter, and shows the characters with the most positive weights. Use the checkbox in the bottom-right to also see the most negative weights (may be slow).
Below that are examples of words for which the filter emits the highest values. A filter's response is its maximum value over all substrings it sees in the word. So if a filter has high weights on 'c' in the first position, then 'a', then 't', it will assign equally high scores to 'cat', 'fatcat', 'concatenate', etc. The portion of the string in blue is the substring the filter is responding to.
'^' and '$' represent beginning and end of word markers, respectively. '_' is a padding character. Literal versions of those characters are escaped with a backslash.
Use the links at the top to see filters of other widths.
Check out my blog post here for a bit more context.
Show most negative weights
Filter 0 (bias = -0.47) #
Z
l
M
r
8
-
c
a
C
t
5
p
6
i
X
k
K
b
F
I
/
T
Q
u
4
q
9
\195
V
E
y
A
G
h
7
o
"
z
\194
v
E
y
R
m
-
p
j
K
r
n
Q
L
I
f
J
,
b
c
9
i
u
/
"
C
(
o
e
A
7
M
\195
h
v
D
s
l
g
x
k
%
W
j
B
-
a
g
K
d
A
G
Q
D
w
0
,
J
N
v
/
c
U
l
X
p
H
u
q
o
I
R
Z
h
r
T
P
e
Z
p
N
z
Q
t
M
l
8
v
R
m
9
i
"
a
r
k
3
x
7
T
X
s
H
f
$
d
4
B
/
A
-
U
c
,
P
Non-zero for 19.0% of words.
^FRAN CISCO$____ (3.0) ^NEW$ ____ (2.7) ^Ear lier$____(2.7) ^Ear th$____(2.7) ^Ear ly$____(2.7) ^Ear l$____(2.7) ^MIAM I$____ (2.7) ^MRI$ ____ (2.6) ^EU$ ____(2.3) ^EBI TDA$____(2.2) ^IBM $____(2.2) ^Rau l$____(2.1) ^CEO$ ____ (2.1) ^CIA$ ____ (2.1) ^DENV ER$____ (2.1) ^Ran gers$____(2.1) ^Ran dy$____(2.1) ^Ran dolph$____(2.1) ^Ran gel$____(2.1) ^RBI $____(2.0) ^RBI s$____(2.0) ^cran e$____ (2.0) ^TEHRAN $____ (2.0) ^EAD S$____(2.0) ^RAF $____(1.9) ^TIME$_ ___ (1.9) ^Eag les$____(1.9) ^Eag le$____(1.9) ^rar e$____(1.9) ^rar ely$____(1.9) ^HMRC$ ____ (1.9) ^10-K$ ____ (1.9) ^5-2$ ____ (1.8) ^RBS $____(1.8) ^FIA$ ____ (1.8) ^Ray $____(1.8) ^Ray mond$____(1.8) ^Ray s$____(1.8) ^Equ ity$____(1.8) ^Jar ed$____(1.7) Filter 1 (bias = -0.50) #
<BOS>
h
w
T
K
t
Z
D
E
d
X
H
W
C
Q
u
2
y
V
l
9
g
<EOS>
c
J
A
b
p
M
r
6
k
3
i
5
1
"
Y
f
j
F
z
x
g
h
G
f
p
S
c
\$
w
B
-
4
T
N
I
6
D
\194
O
8
r
:
P
"
\195
9
d
(
!
H
U
7
C
W
m
,
0
Q
m
R
x
C
a
r
h
j
L
I
B
'
i
"
y
$
l
7
o
F
f
-
p
S
w
K
W
v
A
q
b
H
l
k
L
U
7
T
X
v
5
B
6
u
n
s
.
f
j
c
/
t
D
y
Z
m
4
E
\194
z
8
R
1
b
d
i
Q
r
2
w
H
p
Non-zero for 14.1% of words.
^overwhel ming$____ (2.4) ^overwhel mingly$____ (2.4) ^overwhel med$____ (2.4) ^awful $____ (2.3) ^unlawful $____ (2.3) ^Excl uding$____ (2.2) ^when $____ (1.9) ^when ever$____ (1.9) ^whol e$____ (1.8) ^whol esale$____ (1.8) ^whol ly$____ (1.8) ^excl usive$____ (1.7) ^excl uding$____ (1.7) ^excl usively$____ (1.7) ^excl uded$____ (1.7) ^whil e$____ (1.7) ^Meanwhil e$____ (1.7) ^meanwhil e$____ (1.7) ^whil st$____ (1.7) ^FRA NCISCO$____(1.7) ^whal es$____ (1.6) ^whal e$____ (1.6) ^whal ing$____ (1.6) ^Ful ham$____(1.6) ^Ful l$____(1.6) ^Ful ler$____(1.6) ^Fel ipe$____(1.6) ^Fel ix$____(1.6) ^Fel low$____(1.6) ^FOX $____(1.5) ^FOX News.com$____(1.5) ^worl d$____ (1.5) ^worl dwide$____ (1.5) ^worl ds$____ (1.5) ^worl d-class$____ (1.5) ^Karl $____ (1.5) ^Expl orer$____ (1.5) ^When $____ (1.4) ^NFC$ ____ (1.4) ^careful ly$____ (1.4) Filter 2 (bias = -0.71) #
Z
l
8
-
9
d
Q
t
3
m
N
v
K
j
C
p
4
f
W
.
7
e
2
g
1
T
X
u
"
b
5
z
V
s
R
x
6
i
B
o
f
g
;
G
'
0
-
1
r
L
t
h
k
D
"
5
Q
A
M
c
R
d
N
)
:
4
m
2
O
7
/
6
o
8
W
C
K
Z
(
x
g
f
c
M
A
j
C
-
a
J
G
e
Q
o
z
F
p
u
d
S
1
s
.
t
X
B
Z
E
q
i
7
m
\163
N
Y
K
D
v
0
w
Q
h
P
j
R
g
k
e
r
D
b
4
'
M
z
E
p
5
"
H
K
F
U
L
\195
t
I
u
a
0
X
.
B
i
$
1
Y
S
V
c
Non-zero for 18.9% of words.
^fak e$____(2.8) ^far $____(2.8) ^far mers$____(2.8) ^far m$____(2.8) ^far ms$____(2.8) ^fab ric$____(2.6) ^fab ulous$____(2.6) ^McQ ueen$____(2.3) ^warfar e$____ (2.1) ^rar e$____(2.0) ^rar ely$____(2.0) ^tak e$____(2.0) ^tak en$____(2.0) ^tak ing$____(2.0) ^tak es$____(2.0) ^tar get$____(1.9) ^tar gets$____(1.9) ^tar geted$____(1.9) ^tar geting$____(1.9) ^Mak e$____(1.9) ^Mak ing$____(1.9) ^backgr ound$____ (1.9) ^backgr ounds$____ (1.9) ^Mar ch$____(1.9) ^Mar k$____(1.9) ^Mar tin$____(1.9) ^Mar yland$____(1.9) ^intak e$____ (1.8) ^Braz il$____ (1.8) ^Braz ilian$____ (1.8) ^parliamentar y$____ (1.8) ^documentar y$____ (1.8) ^voluntar y$____ (1.8) ^commentar y$____ (1.8) ^non-pr ofit$____ (1.8) ^rap idly$____(1.8) ^rap e$____(1.8) ^rap id$____(1.8) ^rap ed$____(1.8) ^Ankar a$____ (1.8) Filter 3 (bias = -0.71) #
x
D
V
T
b
y
<BOS>
t
s
c
9
d
S
r
B
u
5
H
<EOS>
M
J
O
4
\163
f
g
6
1
3
.
W
C
7
e
w
h
8
m
a
q
F
g
f
z
M
G
N
w
e
!
j
c
B
a
;
p
S
C
\$
A
r
d
J
i
"
s
K
U
Q
1
P
?
X
.
9
n
8
-
t
&
z
f
A
F
m
x
.
S
Z
h
w
j
g
R
X
r
G
e
/
N
l
9
U
o
L
E
a
u
C
3
c
-
D
J
T
8
I
"
V
4
6
m
7
r
X
M
\194
k
Q
T
8
g
d
U
5
u
x
t
2
o
9
w
W
y
4
c
1
E
q
f
I
s
a
O
$
G
"
R
3
_
Non-zero for 29.7% of words.
^exemp t$____ (3.0) ^exemp tion$____ (3.0) ^Hoffma n$____ (2.7) ^Vega s$____ (2.6) ^basema n$____ (2.5) ^defensema n$____ (2.5) ^bega n$____ (2.4) ^Luxemb ourg$____ (2.4) ^seld om$____ (2.4) ^FA$ ____(2.4) ^NASDAQ $____ (2.4) ^bend $____ (2.3) ^mixed$ ____ (2.3) ^fixed$ ____ (2.3) ^relaxed$ ____ (2.3) ^sewa ge$____ (2.3) ^rebel$ ____ (2.3) ^label$ ____ (2.3) ^Nobel$ ____ (2.3) ^libel$ ____ (2.3) ^beca use$____ (2.2) ^beca me$____ (2.2) ^Quebec$ ____ (2.2) ^Halifax $____ (2.2) ^embedd ed$____ (2.1) ^fema le$____ (2.1) ^fema les$____ (2.1) ^Gonzalez$ ____ (2.1) ^UEFA$ ____ (2.1) ^Martinez$ ____ (2.0) ^send $____ (2.0) ^send ing$____ (2.0) ^send s$____ (2.0) ^Caribbean $____ (2.0) ^bean s$____ (2.0) ^fad ed$____(2.0) ^fad ing$____(2.0) ^fad e$____(2.0) ^halfwa y$____ (2.0) ^Karza i$____ (2.0) Filter 4 (bias = -0.48) #
A
d
B
p
w
-
<BOS>
l
N
D
Q
P
U
j
Z
v
W
h
K
g
/
x
k
J
<EOS>
T
C
y
s
0
,
m
I
e
V
G
S
o
R
1
z
f
&
F
Q
M
I
h
O
y
!
m
G
n
Y
x
:
k
X
j
\194
e
l
H
(
)
a
B
"
i
R
u
\195
\$
E
4
W
,
s
H
-
I
c
A
o
q
s
X
m
W
G
2
u
i
v
a
M
7
f
1
'
4
.
,
z
Q
g
6
y
B
R
5
U
3
x
t
D
V
r
X
C
v
A
W
s
q
y
-
U
J
t
w
S
6
c
b
h
9
,
e
r
2
F
E
k
Q
m
"
l
7
R
\194
L
8
O
0
g
Z
o
Non-zero for 14.4% of words.
^BEIJ ING$____ (2.9) ^waiv er$____ (2.8) ^Giv en$____(2.4) ^Giv e$____(2.4) ^Giv ing$____(2.4) ^Gav in$____(2.4) ^Aziz $____ (2.4) ^Qae da$____(2.3) ^Alab ama$____ (2.3) ^IAE A$____(2.3) ^Cliv e$____ (2.3) ^Xav ier$____(2.2) ^Inv estors$____(2.2) ^Inv estment$____(2.2) ^Inv estigators$____(2.2) ^Inv estigation$____(2.2) ^slav ery$____ (2.2) ^slav e$____ (2.2) ^slav es$____ (2.2) ^liv e$____(2.1) ^liv es$____(2.1) ^liv ing$____(2.1) ^liv ed$____(2.1) ^liv er$____(2.1) ^lav ish$____(2.1) ^III$ ____ (2.1) ^Ponzi$ ____ (2.1) ^influenza$ ____ (2.0) ^Aviv $____ (2.0) ^naiv e$____ (2.0) ^Brav es$____ (2.0) ^Brav o$____ (2.0) ^Riv er$____(2.0) ^Riv era$____(2.0) ^Riv ers$____(2.0) ^Riv erside$____(2.0) ^puzzle $____ (2.0) ^Elizab eth$____ (2.0) ^sizab le$____ (2.0) ^Rav ens$____(2.0) Filter 5 (bias = -0.53) #
.
J
c
i
d
B
y
j
Q
E
'
3
/
4
C
k
"
0
D
5
U
e
u
V
\194
K
m
S
-
9
s
6
^
p
Z
2
a
f
P
o
A
Y
z
"
d
R
U
h
D
x
g
9
m
W
!
S
.
N
L
\194
w
;
I
3
c
8
a
:
C
O
t
(
Z
'
P
M
)
Q
n
Q
m
E
-
8
o
2
f
7
n
9
l
4
p
I
v
X
'
3
y
F
t
6
.
"
i
5
M
Z
g
N
u
U
w
1
c
A
x
B
h
Q
t
X
f
G
F
Y
h
"
n
O
u
r
s
R
m
K
d
V
v
$
x
\195
.
b
y
9
L
W
D
P
A
z
c
J
i
l
j
Non-zero for 14.1% of words.
^SEO UL$____(2.8) ^char ges$____ (2.7) ^char ged$____ (2.7) ^char ge$____ (2.7) ^Richar d$____ (2.7) ^corr uption$____ (2.5) ^corr ect$____ (2.5) ^corr espondent$____ (2.5) ^corr upt$____ (2.5) ^researcher s$____ (2.4) ^teacher s$____ (2.4) ^teacher $____ (2.4) ^researcher $____ (2.4) ^Researcher s$____ (2.4) ^N.Y.$ ____ (2.2) ^D-N.Y.$ ____ (2.2) ^soar ing$____ (2.2) ^soar ed$____ (2.2) ^soar $____ (2.2) ^Yar d$____(2.2) ^NEW $____(2.1) ^Char les$____ (2.1) ^Char lie$____ (2.1) ^Char lotte$____ (2.1) ^Char gers$____ (2.1) ^Jacob$ ____ (2.1) ^1.25$ ____ (2.1) ^98$ ____(2.1) ^Broncos$ ____ (2.1) ^CO2$ ____ (2.1) ^92$ ____(2.0) ^uproar $____ (2.0) ^97$ ____(2.0) ^VEG AS$____(2.0) ^embryos$ ____ (2.0) ^JER USALEM$____(2.0) ^ambassador$ ____ (2.0) ^Ambassador$ ____ (2.0) ^Ecuador$ ____ (2.0) ^Salvador$ ____ (2.0) Filter 6 (bias = -0.43) #
K
t
Z
g
9
l
8
-
B
r
3
p
6
d
2
j
5
.
4
T
N
'
M
h
J
k
U
m
W
I
L
A
X
S
<EOS>
i
w
q
1
\163
y
x
r
5
T
J
m
4
k
w
U
0
P
6
;
E
'
2
?
3
t
9
Q
S
C
j
u
7
"
o
!
e
D
l
H
L
c
B
d
8
H
f
Z
S
I
p
X
F
L
k
q
s
1
x
u
y
\195
t
7
c
Q
m
A
e
Y
M
D
j
a
O
/
'
.
i
R
K
9
o
J
E
W
A
"
h
Q
g
-
D
\194
H
$
C
K
n
'
l
9
c
X
r
E
L
O
y
w
p
M
d
S
t
s
T
f
F
1
_
a
Non-zero for 16.5% of words.
^AZUZ$ ____ (3.2) ^Kraf t$____ (3.0) ^KABUL$ ____ (3.0) ^Brus sels$____ (2.8) ^ATLANTA$ ____ (2.8) ^3rd$ ____ (2.7) ^23rd$ ____ (2.7) ^Braw n$____ (2.7) ^Mr.$ ____ (2.6) ^Jr.$ ____ (2.6) ^WTA$ ____ (2.6) ^UPI$ ____ (2.5) ^Brav es$____ (2.5) ^Brav o$____ (2.5) ^Kyle $____ (2.4) ^ETA$ ____ (2.3) ^UAW $____(2.2) ^Kenya$ ____ (2.2) ^Chechnya$ ____ (2.2) ^NHL$ ____ (2.2) ^Braz il$____ (2.2) ^Braz ilian$____ (2.2) ^FRANCIS CO$____ (2.1) ^THE $____(2.0) ^Peru$ ____ (2.0) ^EPA$ ____ (2.0) ^Kyi$ ____ (2.0) ^Zelaya$ ____ (2.0) ^Lt.$ ____ (2.0) ^MRI$ ____ (1.9) ^3.1$ ____ (1.9) ^LCD$ ____ (1.9) ^frus tration$____ (1.9) ^frus trated$____ (1.9) ^frus trating$____ (1.9) ^Libya$ ____ (1.9) ^3.7$ ____ (1.9) ^DETRO IT$____ (1.9) ^TD$ ____(1.9) ^camera$ ____ (1.9) Filter 7 (bias = -0.46) #
-
x
R
B
u
c
<BOS>
A
r
o
Q
p
j
K
P
a
"
w
E
v
I
h
'
n
H
W
d
l
F
z
M
5
U
i
J
m
s
0
^
,
V
-
k
u
C
.
)
y
B
o
0
E
X
r
Y
?
v
s
b
d
9
N
7
w
6
t
G
O
\194
(
p
e
5
%
x
a
8
U
c
\$
w
p
W
r
Z
T
X
h
Q
g
V
o
.
D
B
c
K
j
6
C
2
d
b
R
/
t
a
0
U
l
4
y
$
O
m
P
G
k
f
z
W
g
M
l
F
R
y
T
6
r
4
u
X
A
,
b
5
G
n
\195
8
Y
K
U
3
k
2
-
Z
v
N
E
/
J
e
D
x
s
Non-zero for 20.2% of words.
^low-key $____ (2.6) ^VW$ ____(2.6) ^Arkan sas$____ (2.6) ^Turkey $____ (2.4) ^turkey $____ (2.4) ^Van $____(2.4) ^Van couver$____(2.4) ^Van essa$____(2.4) ^servan ts$____ (2.2) ^servan t$____ (2.2) ^Cuban $____ (2.1) ^Cuban s$____ (2.1) ^urban $____ (2.1) ^suburban $____ (2.1) ^Urban $____ (2.1) ^V.$ ____(2.0) ^occupan ts$____ (2.0) ^occupan cy$____ (2.0) ^survey $____ (1.9) ^survey ed$____ (1.9) ^survey s$____ (1.9) ^Survey $____ (1.9) ^Harvey $____ (1.9) ^Va$ ____(1.9) ^Milwaukee $____ (1.9) ^Ven ezuela$____(1.9) ^Ven ezuelan$____(1.9) ^Ven us$____(1.9) ^Ven ice$____(1.9) ^dubbe d$____ (1.9) ^rubbe r$____ (1.9) ^SOURCE$ ____ (1.8) ^Mercan tile$____ (1.8) ^0.6 $____(1.8) ^Crawf ord$____ (1.8) ^Secretary-Gen eral$____ (1.8) ^Duke$ ____ (1.8) ^Luke$ ____ (1.8) ^works$ ____ (1.8) ^networks$ ____ (1.8) Filter 8 (bias = -0.38) #
<BOS>
f
X
y
.
U
g
F
Q
P
q
K
-
u
V
s
Y
c
b
D
W
,
\194
C
w
o
l
M
^
B
I
R
k
L
N
r
W
f
T
x
H
r
i
b
U
p
1
F
Y
l
M
n
E
.
4
P
2
'
y
q
O
j
G
-
u
k
)
R
3
N
w
\195
&
\$
D
a
L
-
U
f
A
v
z
c
l
j
Y
M
Q
e
a
k
/
n
X
x
G
'
K
t
Z
p
7
g
6
r
I
h
o
F
q
w
x
k
8
g
"
w
S
T
y
i
\194
-
6
r
5
I
9
t
Q
H
,
j
L
J
F
m
7
b
f
\195
c
A
o
u
N
q
K
V
/
P
Non-zero for 13.5% of words.
^Wax man$____(2.6) ^WAS HINGTON$____(2.5) ^Tax $____(2.5) ^MLS $____(2.4) ^Max $____(2.2) ^pre-tax $____ (2.1) ^WTA$ ____ (2.0) ^Way ne$____(1.9) ^Way $____(1.9) ^Kyrgyzs tan$____ (1.9) ^engulf ed$____ (1.8) ^tax $____(1.8) ^tax es$____(1.8) ^tax payers$____(1.8) ^tax payer$____(1.8) ^Tay lor$____(1.8) ^Wild $____ (1.7) ^Wild life$____ (1.7) ^Wild cats$____ (1.7) ^UAW $____(1.7) ^YOU$ ____ (1.7) ^wild $____ (1.7) ^wild life$____ (1.7) ^wild ly$____ (1.7) ^wild fires$____ (1.7) ^Hay es$____(1.7) ^Hay den$____(1.7) ^Hay ward$____(1.7) ^Wils on$____ (1.7) ^Oly mpic$____(1.7) ^Oly mpics$____(1.7) ^Wac hovia$____(1.7) ^max imum$____(1.6) ^max imize$____(1.6) ^DAL LAS$____(1.6) ^Wad e$____(1.6) ^1,8 00$____(1.6) ^EU$ ____(1.6) ^highly $____ (1.6) ^roughly $____ (1.6) Filter 9 (bias = -0.54) #
o
F
O
k
K
V
<BOS>
b
N
h
/
g
T
H
D
x
z
4
"
s
W
j
-
A
M
i
\194
a
Y
n
,
Z
Q
C
R
u
G
e
r
7
Q
m
O
g
N
v
(
k
I
b
X
u
K
-
&
x
W
)
/
h
,
n
2
c
"
i
E
p
:
V
S
s
3
'
\$
d
5
j
8
f
V
-
F
y
j
o
Z
u
7
d
X
m
A
f
5
s
B
p
0
O
4
i
9
U
J
t
Q
'
8
a
b
v
6
w
L
W
C
z
H
,
h
p
u
K
H
z
R
c
Y
P
k
G
j
d
S
y
F
O
V
w
4
D
B
m
A
o
t
2
\194
a
M
\163
r
f
N
X
s
0
b
L
Non-zero for 13.6% of words.
^shotgu n$____ (2.5) ^NAS A$____(2.3) ^NAS CAR$____(2.3) ^NAS DAQ$____(2.3) ^OF$ ____(2.3) ^TOKY O$____ (2.2) ^foreh and$____ (2.1) ^OFT $____(2.1) ^Och oa$____(2.1) ^DNA$ ____ (2.1) ^Nku nda$____(2.1) ^botch ed$____ (2.1) ^NBA $____(2.0) ^WASHINGTON$ ____ (1.9) ^BOSTON$ ____ (1.9) ^HOUSTON$ ____ (1.9) ^NHS $____(1.9) ^corru ption$____ (1.9) ^corru pt$____ (1.9) ^NYS E$____(1.9) ^NFC $____(1.9) ^LONDON$ ____ (1.9) ^ON$_ ___ (1.8) ^IV$ ____(1.8) ^QC$ ____(1.8) ^FOXNe ws.com$____ (1.8) ^Obs erver$____(1.8) ^downh ill$____ (1.7) ^torch $____ (1.7) ^orch estra$____ (1.7) ^orch estrated$____ (1.7) ^TEHR AN$____ (1.7) ^NCA A$____(1.7) ^NAT O$____(1.7) ^DETROIT$ ____ (1.6) ^MIAM I$____ (1.6) ^DENVE R$____ (1.6) ^Oth er$____(1.6) ^Oth ers$____(1.6) ^Oth erwise$____(1.6) Filter 10 (bias = -0.44) #
<BOS>
p
Z
t
M
T
V
y
-
d
J
i
Q
a
b
h
R
,
X
c
j
A
9
O
<EOS>
1
'
o
.
D
N
q
/
\163
^
l
"
x
8
I
.
P
W
G
\194
J
q
p
(
K
:
f
t
%
Q
r
"
R
!
o
?
\195
v
C
H
j
X
k
d
n
a
)
&
3
h
s
T
i
\$
z
P
g
9
h
8
m
2
k
6
A
X
.
F
t
J
Y
N
c
K
s
e
z
3
i
7
'
Q
l
5
v
I
o
D
x
Z
u
E
T
f
b
W
.
i
r
4
c
2
l
3
x
H
d
w
b
M
L
V
D
X
A
6
a
1
f
5
z
Z
F
Y
o
$
R
J
N
E
u
7
v
t
Non-zero for 15.2% of words.
^Wei ss$____(2.9) ^Wei r$____(2.9) ^Wei nstein$____(2.9) ^Madi son$____ (2.3) ^Hei ghts$____(2.3) ^Hei neken$____(2.3) ^Hei di$____(2.3) ^Bernstei n$____ (2.2) ^7.2$ ____ (2.2) ^Zuri ch$____ (2.2) ^7.6$ ____ (2.2) ^6.8$ ____ (2.2) ^Mae$ ____ (2.1) ^6.2$ ____ (2.1) ^4.9$ ____ (2.1) ^4.8$ ____ (2.1) ^9.5$ ____ (2.1) ^6.6$ ____ (2.1) ^4.2$ ____ (2.1) ^3.9$ ____ (2.1) ^3.8$ ____ (2.1) ^3.2$ ____ (2.1) ^4.6$ ____ (2.1) ^unvei led$____ (2.1) ^unvei l$____ (2.1) ^unvei ling$____ (2.1) ^hei ght$____(2.1) ^hei ghtened$____(2.1) ^hei ghts$____(2.1) ^hei r$____(2.1) ^3.6$ ____ (2.0) ^5.9$ ____ (2.0) ^5.8$ ____ (2.0) ^5.2$ ____ (2.0) ^Mari ne$____ (2.0) ^Mari a$____ (2.0) ^Mari nes$____ (2.0) ^Mari o$____ (2.0) ^5.6$ ____ (2.0) ^outfi t$____ (2.0) Filter 11 (bias = -0.56) #
d
M
I
h
-
c
2
Y
P
B
<BOS>
o
a
y
E
k
6
m
s
K
z
N
l
r
p
b
i
T
7
S
1
G
e
g
J
x
j
A
Q
Z
A
f
U
o
Z
-
E
v
X
x
L
n
Q
t
H
'
!
c
G
j
2
p
a
M
V
h
b
\194
Y
u
z
d
)
,
4
S
I
N
&
k
Q
p
N
l
W
h
X
d
M
m
w
L
"
y
9
A
$
g
Z
s
\194
a
K
i
2
U
3
x
P
F
D
u
C
k
x
T
b
D
.
t
n
y
l
u
L
P
5
O
a
U
V
M
A
d
7
r
B
k
6
c
q
R
X
E
w
\163
9
i
Z
p
4
H
N
C
Non-zero for 16.1% of words.
^dawn $____ (3.3) ^IAEA $____ (3.2) ^long-awa ited$____ (2.7) ^MIAMI $____ (2.7) ^Awa rds$____(2.6) ^Awa rd$____(2.6) ^Awa kening$____(2.6) ^DENV ER$____ (2.6) ^CIA$_ ___ (2.6) ^FIA$_ ___ (2.6) ^PHILADELPHIA$_ ___ (2.6) ^Lex us$____(2.4) ^ISLAMA BAD$____ (2.4) ^lawn $____ (2.3) ^spawn ed$____ (2.2) ^lawl ess$____ (2.2) ^cardboa rd$____ (2.2) ^Delawa re$____ (2.1) ^diox ide$____ (2.1) ^Secretary-Gen eral$____ (2.0) ^NASDAQ$ ____ (2.0) ^Xbox $____ (2.0) ^sidewa lk$____ (2.0) ^Dawn $____ (2.0) ^Anb ar$____(2.0) ^box $____(1.9) ^box es$____(1.9) ^box ing$____(1.9) ^box er$____(1.9) ^GMA C$____(1.9) ^AOL $____(1.9) ^FEMA $____ (1.9) ^AIG$_ ___ (1.9) ^mid-199 0s$____ (1.9) ^anx iety$____(1.9) ^anx ious$____(1.9) ^EPA$_ ___ (1.9) ^PA$_ ___ (1.9) ^UAW$ ____ (1.9) ^DAX$ ____ (1.8) Filter 12 (bias = -0.45) #
w
R
.
P
Z
l
W
J
m
j
g
p
A
h
X
f
c
o
M
d
<BOS>
7
a
9
Q
D
E
S
z
F
y
0
/
i
2
r
e
x
q
,
V
t
G
-
K
d
k
e
Y
u
Z
j
c
l
9
.
C
F
z
H
B
D
b
f
)
\$
X
E
Q
L
8
h
W
%
"
I
R
s
0
y
5
k
8
T
6
h
2
A
3
u
4
a
Z
b
7
t
9
m
X
r
Q
l
M
B
F
U
K
q
j
v
S
H
e
Y
$
z
G
\195
/
i
I
j
a
-
W
M
A
f
t
g
,
F
U
x
T
b
i
m
O
.
z
J
2
e
1
Z
Q
'
H
c
B
r
q
h
p
n
/
V
v
Non-zero for 20.8% of words.
^awkwa rd$____ (3.1) ^Hawks$ ____ (2.1) ^Seahawks$ ____ (2.1) ^downwa rd$____ (2.0) ^GMA C$____(1.9) ^Kea ne$____(1.7) ^ticket $____ (1.7) ^ticket s$____ (1.7) ^rocket $____ (1.7) ^cricket $____ (1.7) ^jacket $____ (1.7) ^rocket s$____ (1.7) ^pocket $____ (1.7) ^pocket s$____ (1.7) ^Yea r$____(1.7) ^Yea h$____(1.7) ^Yea rs$____(1.7) ^1.25$ ____ (1.7) ^Vet erans$____(1.7) ^impea chment$____ (1.7) ^Blackwa ter$____ (1.7) ^backwa rd$____ (1.7) ^backwa rds$____ (1.7) ^two-t hirds$____ (1.7) ^two-t ime$____ (1.7) ^wast e$____ (1.7) ^wast ed$____ (1.7) ^wast ing$____ (1.7) ^nickna me$____ (1.7) ^nickna med$____ (1.7) ^breakfa st$____ (1.7) ^want $____ (1.7) ^want ed$____ (1.7) ^want s$____ (1.7) ^want ing$____ (1.7) ^McNa mee$____ (1.7) ^1.9$_ ___ (1.6) ^2.9$_ ___ (1.6) ^0.9$_ ___ (1.6) ^3.9$_ ___ (1.6) Filter 13 (bias = -0.57) #
O
B
d
f
T
n
G
F
D
b
"
x
z
N
W
k
p
h
Q
J
t
L
\194
A
^
9
g
r
c
R
-
a
\163
Z
y
\195
X
u
Y
V
M
x
t
p
U
d
A
v
Z
6
?
0
C
J
m
o
r
-
k
9
H
7
/
l
T
h
.
5
&
8
Q
a
(
i
R
1
'
2
u
3
c
-
y
j
K
l
G
J
O
u
C
I
U
b
M
H
W
e
D
i
z
h
"
E
/
q
Z
g
T
F
m
R
o
s
Q
r
,
7
8
x
X
u
Q
m
\194
r
7
U
6
k
5
f
4
o
W
b
S
-
$
y
2
\195
8
c
"
z
I
w
R
v
n
s
p
B
Non-zero for 15.7% of words.
^McQ ueen$____(3.5) ^YORK$ ____ (2.9) ^dry$ ____ (2.9) ^laundry$ ____ (2.9) ^Try$ ____ (2.8) ^MyS pace$____(2.8) ^My$ ____(2.8) ^empty$ ____ (2.8) ^pretty$ ____ (2.7) ^Betty$ ____ (2.7) ^petty$ ____ (2.7) ^UK$ ____(2.5) ^Guy$ ____ (2.5) ^Orch estra$____ (2.5) ^McD onald$____(2.3) ^McD onnell$____(2.3) ^country$ ____ (2.3) ^industry$ ____ (2.3) ^try$ ____ (2.3) ^Ministry$ ____ (2.3) ^entry$ ____ (2.3) ^ministry$ ____ (2.3) ^Country$ ____ (2.3) ^Industry$ ____ (2.3) ^GAO$ ____ (2.3) ^my$ ____(2.3) ^angry$ ____ (2.3) ^hungry$ ____ (2.3) ^cry$ ____ (2.2) ^outcry$ ____ (2.2) ^Kentucky$ ____ (2.2) ^lucky$ ____ (2.2) ^Ricky$ ____ (2.2) ^tricky$ ____ (2.2) ^Once $____ (2.2) ^McC ain$____(2.2) ^McC hrystal$____(2.2) ^McC arthy$____(2.2) ^McC onnell$____(2.2) ^OTC$ ____ (2.1) Filter 14 (bias = -0.49) #
X
t
m
v
y
c
Y
z
K
C
Q
s
"
I
b
D
W
u
L
d
M
T
<BOS>
0
f
g
V
k
/
w
Z
n
H
U
^
j
8
A
P
E
N
j
W
p
w
g
K
C
O
k
(
d
o
P
&
F
E
)
Q
l
?
V
"
h
:
v
/
i
a
D
U
n
.
m
B
c
2
0
3
T
X
t
p
F
G
n
Y
u
x
s
b
A
6
j
8
I
v
H
0
w
K
N
"
r
z
f
W
.
V
C
9
M
P
e
\194
k
7
S
J
E
V
y
Q
H
b
t
\194
u
7
o
S
h
z
w
9
q
G
T
R
i
X
p
Y
r
5
-
C
a
j
W
J
O
l
e
s
d
Z
1
6
N
Non-zero for 15.5% of words.
^NYS E$____(3.0) ^NYC $____(2.8) ^NY$ ____(2.5) ^maps $____ (2.5) ^mob$ ____ (2.5) ^robb ery$____ (2.4) ^robb ed$____ (2.4) ^robb ers$____ (2.4) ^robb eries$____ (2.4) ^presumabl y$____ (2.3) ^payabl e$____ (2.3) ^box$ ____ (2.3) ^Xbox$ ____ (2.3) ^crops $____ (2.2) ^drops $____ (2.2) ^Shrops hire$____ (2.2) ^Obs erver$____(2.2) ^map$ ____ (2.2) ^obj ect$____(2.2) ^obj ects$____(2.2) ^obj ective$____(2.2) ^obj ections$____(2.2) ^obj ectives$____(2.2) ^obj ected$____(2.2) ^obl igations$____(2.2) ^obl igation$____(2.2) ^obl iged$____(2.2) ^obs ervers$____(2.1) ^obs erved$____(2.1) ^obs tacles$____(2.1) ^obs ervation$____(2.1) ^obs tacle$____(2.1) ^obs cure$____(2.1) ^obs ession$____(2.1) ^probabl y$____ (2.1) ^probabl e$____ (2.1) ^probl ems$____ (2.1) ^probl em$____ (2.1) ^probl ematic$____ (2.1) ^lobb y$____ (2.1) Filter 15 (bias = -0.45) #
w
F
<BOS>
f
W
P
X
D
Q
x
.
c
-
s
q
C
Y
d
I
j
^
U
H
S
p
y
k
h
L
v
0
B
h
p
Y
d
B
-
N
P
H
z
(
f
W
s
A
c
4
I
L
m
X
'
\194
C
:
g
9
r
8
D
q
t
5
G
6
i
7
\163
Q
k
Y
k
G
F
K
f
y
v
O
j
L
t
M
x
/
I
Z
q
m
n
X
e
W
r
U
-
"
b
$
p
o
d
3
a
s
.
B
H
f
h
K
1
z
4
o
u
B
y
p
g
l
d
O
D
r
7
P
Z
J
6
R
8
s
W
\195
Y
b
X
w
q
k
2
x
"
N
M
S
Non-zero for 7.8% of words.
^BAGH DAD$____ (2.2) ^hyg iene$____(2.1) ^why$ ____ (2.1) ^hyd rogen$____(2.1) ^Why$ ____ (2.0) ^WASH INGTON$____ (2.0) ^24-hou r$____ (1.8) ^two-hou r$____ (1.8) ^half-hou r$____ (1.8) ^in-hou se$____ (1.8) ^Loh an$____(1.8) ^Hyu ndai$____(1.7) ^NY$ ____(1.7) ^Hyd e$____(1.7) ^hou se$____(1.7) ^hou rs$____(1.7) ^hou sing$____(1.7) ^hou r$____(1.7) ^who$ ____ (1.7) ^BEIJING$ ____ (1.6) ^You $____(1.6) ^You ng$____(1.6) ^You r$____(1.6) ^You Tube$____(1.6) ^WHO$ ____ (1.6) ^Who$ ____ (1.6) ^throughou t$____ (1.5) ^Throughou t$____ (1.5) ^Bou levard$____(1.5) ^Bou rnemouth$____(1.5) ^Boy le$____(1.5) ^Boy $____(1.5) ^Boy s$____(1.5) ^Boy d$____(1.5) ^NYC $____(1.5) ^Amy $____(1.5) ^Nou ri$____(1.5) ^Bod y$____(1.5) ^WAM$ ____ (1.4) ^BMW $____(1.4) Filter 16 (bias = -0.51) #
V
w
7
f
j
o
Y
a
C
t
Z
p
4
-
F
y
8
W
G
v
R
O
6
m
X
K
H
q
5
z
L
.
g
x
0
i
Q
,
9
N
w
L
W
D
I
h
q
y
-
%
.
G
v
Y
!
l
Q
P
?
U
t
R
a
u
2
J
:
C
X
o
(
m
e
F
'
r
E
T
\194
H
L
k
K
-
X
i
Z
h
/
R
D
'
m
p
A
v
Q
u
E
C
U
n
M
x
.
g
6
r
e
s
z
o
G
t
N
j
l
c
B
d
X
L
7
m
v
u
6
o
9
l
Q
A
V
U
\194
.
C
r
2
s
8
y
W
f
0
h
4
-
5
O
I
_
1
\195
p
E
"
a
q
t
Non-zero for 14.1% of words.
^Gwen $____ (3.1) ^10-K$ ____ (3.0) ^CIA$ ____ (3.0) ^7-6$ ____ (3.0) ^wav e$____(2.9) ^wav es$____(2.9) ^wav ing$____(2.9) ^wav ed$____(2.9) ^7.6$ ____ (2.8) ^FIA$ ____ (2.8) ^we$ ____(2.8) ^7-5$ ____ (2.8) ^jam$ ____ (2.7) ^Swed en$____ (2.7) ^Swed ish$____ (2.7) ^7.2$ ____ (2.7) ^Rwan da$____ (2.6) ^7.5$ ____ (2.6) ^PHILADELP HIA$____ (2.6) ^wen t$____(2.5) ^wed ding$____(2.5) ^wed dings$____(2.5) ^Camp bell$____ (2.5) ^Camp $____ (2.5) ^Camp aign$____ (2.5) ^wak e$____(2.4) ^Swee t$____ (2.3) ^Swee ney$____ (2.3) ^4-6$ ____ (2.3) ^Swan sea$____ (2.3) ^Swan n$____ (2.3) ^Imp erial$____(2.2) ^wei ght$____(2.2) ^wei ghed$____(2.2) ^wei gh$____(2.2) ^wei ghing$____(2.2) ^BERLIN$ ____ (2.2) ^NL$ ____(2.2) ^7.4$ ____ (2.2) ^Zimbabwe$ ____ (2.2) Filter 17 (bias = -0.36) #
Q
b
8
m
7
B
d
.
3
k
6
A
2
w
1
v
"
q
5
T
4
r
O
g
s
h
,
t
9
l
S
M
C
n
^
\195
I
f
\194
u
&
j
W
u
K
-
Q
F
X
f
z
e
\194
k
/
r
Y
H
!
P
8
t
5
g
(
m
"
i
G
h
o
d
O
E
9
M
N
J
c
)
C
v
U
x
G
q
Z
o
V
f
g
-
A
W
R
e
H
w
Y
a
P
N
k
.
r
B
s
d
Q
b
/
E
m
h
M
9
S
\194
$
\163
w
T
n
c
-
G
W
z
2
k
6
U
X
Y
Z
R
N
r
e
C
q
S
4
g
3
D
5
A
I
s
H
h
M
l
f
p
J
O
/
b
Non-zero for 12.6% of words.
^QC$ ____(2.4) ^WAM $____(2.1) ^Win dows$____(2.0) ^Win ter$____(2.0) ^Win frey$____(2.0) ^Win gs$____(2.0) ^Kin g$____(1.9) ^Kin gdom$____(1.9) ^Kin gs$____(1.9) ^Kin dle$____(1.9) ^Kin gston$____(1.9) ^XVI $____(1.9) ^\194\174$ ____(1.9) ^IOC$ ____ (1.9) ^Wi- Fi$____(1.8) ^Corn wall$____ (1.8) ^Corn ell$____ (1.8) ^Wagn er$____ (1.8) ^Ki- moon$____(1.8) ^Kre mlin$____(1.8) ^Xin hua$____(1.7) ^Xin jiang$____(1.7) ^doin g$____ (1.7) ^wrongdoin g$____ (1.7) ^Kuw ait$____(1.7) ^S.C. $____ (1.6) ^ACORN $____ (1.6) ^Carn egie$____ (1.6) ^Carn ival$____ (1.6) ^HSBC$ ____ (1.6) ^AC$ ____(1.6) ^message $____ (1.6) ^message s$____ (1.6) ^passage $____ (1.6) ^usage $____ (1.6) ^ICC$ ____ (1.6) ^Warw ickshire$____ (1.5) ^dose $____ (1.5) ^dose s$____ (1.5) ^overdose $____ (1.5) Filter 18 (bias = -0.64) #
Q
f
U
x
X
p
W
j
Y
g
"
F
H
c
Z
v
<BOS>
h
^
n
/
-
2
m
I
l
1
e
3
t
.
k
b
s
S
v
f
W
l
c
F
?
P
w
j
q
%
T
m
"
L
0
p
9
S
2
r
a
s
1
;
z
,
)
e
E
J
N
'
(
i
Y
/
8
d
z
h
O
F
C
b
p
e
d
H
G
M
c
u
Q
m
o
j
\194
f
D
k
0
B
1
E
/
w
,
g
K
r
$
.
I
A
8
q
P
V
m
v
X
u
Z
U
.
R
M
s
y
z
L
k
H
E
/
B
g
T
Q
9
V
a
l
0
A
d
n
I
$
x
r
o
Y
J
t
i
Non-zero for 15.9% of words.
^vom iting$____(2.6) ^Wom en$____(2.6) ^Wom an$____(2.6) ^com pany$____(2.4) ^com e$____(2.4) ^com panies$____(2.4) ^com es$____(2.4) ^wom en$____(2.3) ^wom an$____(2.3) ^voy age$____(2.2) ^Viacom $____ (1.8) ^Tom $____(1.8) ^Tom my$____(1.8) ^Tom linson$____(1.8) ^Tom as$____(1.8) ^adm inistration$____(1.8) ^adm itted$____(1.8) ^adm it$____(1.8) ^adm inistrative$____(1.8) ^cog nitive$____(1.8) ^vol ume$____(1.8) ^vol unteers$____(1.8) ^vol untary$____(1.8) ^vol atile$____(1.8) ^Wol f$____(1.7) ^Wol ves$____(1.7) ^von $____(1.7) ^newcom ers$____ (1.7) ^newcom er$____ (1.7) ^Edm onton$____(1.7) ^Won der$____(1.7) ^cam e$____(1.7) ^cam paign$____(1.7) ^cam p$____(1.7) ^cam era$____(1.7) ^200m $____ (1.6) ^Wim bledon$____(1.6) ^Wor ld$____(1.6) ^Wor kers$____(1.6) ^Wor k$____(1.6) Filter 19 (bias = -0.39) #
9
m
7
f
Y
k
"
A
\194
U
8
t
-
p
R
a
<BOS>
B
Q
b
3
F
6
w
o
z
1
s
^
y
X
P
0
i
4
l
N
K
J
L
D
k
d
b
"
A
8
f
O
i
0
m
&
B
c
V
\194
n
o
F
G
h
Q
H
6
r
(
l
2
a
X
s
1
g
9
t
E
x
7
p
U
f
z
-
C
o
G
j
Q
x
A
M
a
e
s
n
I
h
c
l
R
J
Z
N
E
m
k
v
P
q
D
i
T
w
"
t
1
F
\163
.
Q
j
"
D
W
0
U
d
Y
l
$
v
r
c
H
n
b
p
y
g
X
5
Z
J
k
x
'
o
t
C
e
F
1
-
Non-zero for 14.5% of words.
^YOU$ ____ (3.5) ^NASDAQ $____ (3.0) ^DC$ ____(2.9) ^DUP $____(2.8) ^EU$ ____(2.6) ^DAX $____(2.5) ^MDC$ ____ (2.5) ^Da$ ____(2.5) ^Dar ling$____(2.4) ^Dar fur$____(2.4) ^Dar ren$____(2.4) ^Dar win$____(2.4) ^Dar k$____(2.4) ^Day $____(2.3) ^Day s$____(2.3) ^Dak ota$____(2.3) ^USDA$ ____ (2.2) ^SOUR CE$____ (2.1) ^CDC$ ____ (2.1) ^Dam e$____(2.1) ^Dam ascus$____(2.1) ^Dam on$____(2.1) ^Dam ien$____(2.1) ^one-day $____ (2.1) ^two-day $____ (2.1) ^three-day $____ (2.1) ^day-to-day $____ (2.1) ^soda$ ____ (2.0) ^Woods$ ____ (2.0) ^goods$ ____ (2.0) ^methods$ ____ (2.0) ^periods$ ____ (2.0) ^foods$ ____ (2.0) ^neighborhoods$ ____ (2.0) ^1990s$ ____ (2.0) ^90s$ ____ (2.0) ^mid-1990s$ ____ (2.0) ^1970s$ ____ (2.0) ^70s$ ____ (2.0) ^da$ ____(2.0) Filter 20 (bias = -0.52) #
M
g
F
p
<BOS>
d
f
z
V
c
B
G
m
r
/
-
X
a
Z
o
\194
T
6
\163
S
E
K
0
4
D
H
O
Q
v
,
1
L
\195
5
I
D
p
H
x
L
b
(
f
u
k
/
w
U
v
&
-
I
g
A
o
T
m
F
'
1
a
M
G
t
c
7
r
Q
z
C
i
6
W
Z
V
Y
F
m
f
i
e
X
E
g
N
G
B
w
c
-
s
W
D
V
x
l
j
z
v
p
u
Z
S
$
t
/
8
H
9
h
r
R
F
-
0
i
5
p
V
r
\194
a
c
O
9
y
B
u
M
U
6
I
8
w
v
E
Z
s
C
d
X
P
7
H
j
\195
D
t
x
o
4
m
Non-zero for 14.3% of words.
^FDIC $____ (2.4) ^Dic k$____(2.2) ^Div ision$____(2.1) ^D-C alif$____(2.0) ^Hic ks$____(1.9) ^Dix on$____(1.9) ^DVD $____(1.8) ^DVD s$____(1.8) ^MIAM I$____ (1.7) ^FDA$ ____ (1.7) ^Doc tors$____(1.7) ^Doc tor$____(1.7) ^func tion$____ (1.6) ^func tions$____ (1.6) ^func tioning$____ (1.6) ^func tional$____ (1.6) ^MDC$ ____ (1.6) ^Dov er$____(1.6) ^Mumb ai$____ (1.6) ^DAX $____(1.6) ^D-N .Y.$____(1.5) ^Dav id$____(1.5) ^Dav is$____(1.5) ^Dav e$____(1.5) ^Dav ies$____(1.5) ^Liv erpool$____(1.5) ^Liv e$____(1.5) ^Liv ing$____(1.5) ^Liv ni$____(1.5) ^BAGHDAD $____ (1.5) ^HIV $____(1.5) ^Di$ ____(1.5) ^conflic t$____ (1.4) ^conflic ts$____ (1.4) ^conflic ting$____ (1.4) ^inflic ted$____ (1.4) ^D.C .$____(1.4) ^Hin du$____(1.4) ^Hoc key$____(1.4) ^liftin g$____ (1.4) Filter 21 (bias = -0.47) #
8
w
6
z
7
g
L
t
9
s
F
O
X
-
H
E
h
G
x
i
q
k
N
S
Z
o
4
p
B
T
1
c
n
m
Q
U
\194
I
D
'
h
c
Y
z
H
d
:
v
(
p
W
P
S
D
i
)
A
w
u
C
4
e
o
I
"
\163
L
0
N
s
3
G
l
f
y
-
;
F
/
E
N
k
5
m
8
-
X
p
K
u
9
g
Q
i
L
b
6
f
o
s
7
P
\194
'
/
U
0
r
D
v
3
d
O
y
W
t
2
V
4
w
I
m
S
-
Q
u
E
f
O
v
t
n
A
x
W
b
X
U
2
'
7
y
5
M
N
Z
4
c
T
s
H
d
,
J
e
o
.
k
Non-zero for 14.7% of words.
^NYSE $____ (3.3) ^hot el$____(3.1) ^hot $____(3.1) ^hot els$____(3.1) ^hot test$____(3.1) ^Mourinho$ ____ (2.8) ^NHL$ ____ (2.8) ^Hot el$____(2.8) ^Hot $____(2.8) ^Hot els$____(2.8) ^ATLANT A$____ (2.7) ^PHILADE LPHIA$____ (2.6) ^Phoe nix$____ (2.5) ^Foot ball$____ (2.4) ^NHS$ ____ (2.4) ^Idaho$ ____ (2.2) ^HBO S$____(2.2) ^HBO $____(2.2) ^Manhat tan$____ (2.2) ^bondhol ders$____ (2.2) ^shoot ing$____ (2.2) ^shoot $____ (2.2) ^shoot ings$____ (2.2) ^shoot out$____ (2.2) ^quot ed$____ (2.2) ^quot e$____ (2.2) ^quot es$____ (2.2) ^quot ing$____ (2.2) ^hor se$____(2.1) ^hor ses$____(2.1) ^hor ror$____(2.1) ^hor mone$____(2.1) ^hor rible$____(2.1) ^hor izon$____(2.1) ^1945$ ____ (2.1) ^Who$ ____ (2.1) ^Ho$ ____(2.1) ^hol d$____(2.0) ^hol ding$____(2.0) ^hol iday$____(2.0) Filter 22 (bias = -0.47) #
L
t
K
k
y
j
U
-
G
v
/
i
Z
g
D
I
8
w
O
q
Q
n
X
h
P
e
o
u
N
H
m
'
z
V
"
s
<EOS>
4
Y
<BOS>
c
P
h
f
0
U
Y
m
o
!
v
I
g
%
S
-
x
;
G
F
T
r
\194
:
5
a
9
d
4
\195
W
L
8
s
C
Z
1
u
7
H
Q
p
/
g
\194
i
$
x
"
P
U
h
.
v
s
0
'
J
N
b
R
e
G
d
c
1
q
r
y
k
f
X
h
B
y
K
-
W
d
Q
g
Z
t
z
u
b
r
V
f
9
j
6
p
J
s
2
n
Y
c
w
o
\194
F
8
i
U
C
"
'
5
H
Non-zero for 15.2% of words.
^Los$ ____ (3.2) ^LSU$ ____ (3.1) ^Lou$ ____ (3.0) ^MLS$_ ___ (2.8) ^LG$_ ___ (2.7) ^embryos$ ____ (2.7) ^Tokyo$_ ___ (2.6) ^Mayo$_ ___ (2.6) ^you$ ____ (2.4) ^Lisb on$____ (2.4) ^TOKYO$ ____ (2.4) ^Oh$_ ___ (2.3) ^Go$_ ___ (2.3) ^MSNB C$____ (2.2) ^LLC$_ ___ (2.2) ^PLC$_ ___ (2.2) ^1980s$ ____ (2.2) ^80s$ ____ (2.2) ^80$_ ___ (2.1) ^1980$_ ___ (2.1) ^180$_ ___ (2.1) ^Toyota $____ (2.1) ^0.9 $____(2.1) ^Pc$_ ___ (2.0) ^bloc$_ ___ (2.0) ^havoc$_ ___ (2.0) ^oh$_ ___ (2.0) ^cub ic$____(2.0) ^US$_ ___ (2.0) ^hub $____(2.0) ^Koso vo$____ (2.0) ^Do$_ ___ (2.0) ^Lt.$ ____ (2.0) ^0.6 $____(2.0) ^Low$ ____ (1.9) ^Loui s$____ (1.9) ^Loui siana$____ (1.9) ^Loui sville$____ (1.9) ^Loui se$____ (1.9) ^No.$ ____ (1.9) Filter 23 (bias = -0.36) #
.
k
-
B
<BOS>
p
Q
P
\194
f
/
K
^
i
"
U
X
x
v
J
T
C
0
b
z
c
F
R
\195
Z
k
Q
j
!
h
&
i
/
p
X
t
?
f
K
F
"
r
W
u
(
e
\194
l
M
g
.
P
8
E
G
x
:
v
d
T
J
G
f
g
u
j
i
L
y
z
W
Z
h
D
x
A
-
0
,
X
o
E
q
V
a
l
k
c
H
J
p
5
n
.
v
Q
'
F
t
b
d
.
p
Q
P
b
i
w
T
E
d
Z
D
s
y
W
t
x
H
"
1
z
j
A
C
N
r
a
n
V
-
S
k
X
o
L
f
B
h
\194
u
Non-zero for 5.6% of words.
^.... $____ (4.0) ^.... .$____ (4.0) ^U.K.$ ____ (3.2) ^G.M.$ ____ (2.8) ^...$ ____ (2.7) ^non-GAA P$____ (2.6) ^U.N.$ ____ (2.4) ^N.Y.$ ____ (2.4) ^D-N.Y.$ ____ (2.4) ^middle-cla ss$____ (2.3) ^working-cla ss$____ (2.3) ^world-cla ss$____ (2.3) ^first-cla ss$____ (2.3) ^N.F.L.$ ____ (2.2) ^McQ ueen$____(2.1) ^p.m.$ ____ (2.1) ^a.m.$ ____ (2.1) ^KAB UL$____(2.0) ^MLS $____(2.0) ^1.25$ ____ (2.0) ^D.C.$ ____ (2.0) ^N.C.$ ____ (2.0) ^S.C.$ ____ (2.0) ^LG$ ____(2.0) ^Zea land$____(2.0) ^Q.$ ____(1.9) ^guardian.co. uk$____ (1.9) ^Web $____(1.9) ^Web b$____(1.9) ^Web ber$____(1.9) ^Web ster$____(1.9) ^Web er$____(1.9) ^Ph.D.$ ____ (1.8) ^WAS HINGTON$____(1.8) ^N.J. $____ (1.8) ^north-wes t$____ (1.8) ^MGM $____(1.8) ^L.A. $____ (1.8) ^Mr. $____(1.8) ^80s $____(1.7) Filter 24 (bias = -0.43) #
<BOS>
1
f
H
'
g
F
y
l
G
x
i
.
h
\194
U
j
2
S
u
v
3
t
A
b
a
m
T
-
4
Q
E
B
Y
M
D
e
Z
N
0
G
u
X
t
K
m
2
h
8
-
5
k
Z
f
Q
l
0
s
9
.
3
i
7
H
6
v
O
'
1
U
4
a
c
b
W
A
E
T
D
%
m
I
Y
E
v
2
T
r
k
d
h
N
\194
e
b
n
B
3
V
s
M
w
W
-
"
5
'
1
x
P
$
O
X
F
u
a
t
7
l
j
d
f
D
w
\194
B
7
k
Y
E
C
r
6
b
1
K
$
e
l
N
X
x
/
F
Q
a
L
o
0
t
8
S
A
p
s
_
Non-zero for 20.2% of words.
^feud $____ (2.8) ^watchd og$____ (2.7) ^2m$ ____(2.6) ^MGM$ ____ (2.5) ^fold $____ (2.4) ^3m$ ____(2.4) ^problem$ ____ (2.3) ^Jerusalem$ ____ (2.3) ^Harlem$ ____ (2.3) ^EBITD A$____ (2.2) ^50m$ ____ (2.2) ^GB$ ____(2.2) ^GM$ ____(2.2) ^Kid s$____(2.1) ^Kid d$____(2.1) ^1m$ ____(2.1) ^system$ ____ (2.1) ^stem$ ____ (2.1) ^System$ ____ (2.1) ^item$ ____ (2.1) ^post-mortem$ ____ (2.1) ^food $____ (2.0) ^food s$____ (2.0) ^seafood $____ (2.0) ^NOT$ ____ (2.0) ^God $____(2.0) ^slalom$ ____ (1.9) ^fema le$____ (1.9) ^fema les$____ (1.9) ^fell $____ (1.9) ^fell ow$____ (1.9) ^Rockefell er$____ (1.9) ^match$ ____ (1.9) ^watch$ ____ (1.9) ^pitch$ ____ (1.9) ^catch$ ____ (1.9) ^OTC $____(1.9) ^NYC $____(1.8) ^check$ ____ (1.8) ^neck$ ____ (1.8) Filter 25 (bias = -0.41) #
Z
p
9
t
F
i
8
w
V
o
7
O
X
-
L
y
Q
a
b
s
J
f
6
m
R
W
<BOS>
,
M
z
j
d
B
I
0
g
4
'
Y
T
(
v
Q
x
&
p
O
f
/
k
:
b
?
)
N
j
"
V
!
i
r
m
H
J
A
B
I
g
.
F
U
0
R
n
Y
e
y
c
w
H
b
1
x
i
v
4
.
2
-
3
f
D
'
5
r
,
z
I
k
M
a
A
q
y
c
C
p
t
o
6
R
/
\195
Z
B
T
"
O
\163
-
F
Y
t
b
f
g
y
l
,
G
D
.
c
V
B
x
C
z
K
\194
N
'
n
X
P
J
T
Q
U
$
M
"
e
m
I
\195
k
2
Non-zero for 15.6% of words.
^brib es$____ (2.6) ^brib ery$____ (2.6) ^brig ht$____ (2.4) ^brig ade$____ (2.4) ^bril liant$____ (2.3) ^bril liantly$____ (2.3) ^Brig hton$____ (2.2) ^Brig ade$____ (2.2) ^Nig eria$____(2.0) ^Nig ht$____(2.0) ^Nig erian$____(2.0) ^Nig el$____(2.0) ^Nig er$____(2.0) ^Oil $____(2.0) ^rib s$____(1.9) ^buil ding$____ (1.9) ^buil d$____ (1.9) ^buil t$____ (1.9) ^buil dings$____ (1.9) ^rig ht$____(1.8) ^rig hts$____(1.8) ^rig ht-wing$____(1.8) ^rig orous$____(1.8) ^rig htly$____(1.8) ^Hig h$____(1.8) ^Hig hway$____(1.8) ^Hig her$____(1.8) ^Hig hland$____(1.8) ^terrib le$____ (1.8) ^horrib le$____ (1.8) ^terrib ly$____ (1.8) ^Flig ht$____ (1.7) ^19th- century$____ (1.7) ^Wi- Fi$____(1.7) ^Buil ding$____ (1.7) ^Uig hurs$____(1.7) ^Uig hur$____(1.7) ^bail out$____ (1.7) ^bail $____ (1.7) ^bail ed$____ (1.7) Filter 26 (bias = -0.52) #
C
b
I
J
U
x
d
v
c
E
n
e
D
h
Q
W
/
w
,
B
'
q
s
M
t
Y
P
o
R
-
A
4
z
i
F
m
r
j
y
X
J
y
j
h
X
U
0
s
7
t
Z
a
-
u
I
.
V
f
2
x
P
m
5
A
9
S
6
"
n
c
g
W
G
,
1
o
3
k
)
B
W
u
p
D
K
j
i
.
B
g
x
c
X
F
a
d
,
-
O
C
w
r
Y
R
q
L
f
s
k
M
"
Z
6
n
V
U
Q
h
2
e
Q
f
"
J
X
P
Y
o
\194
i
Z
-
$
I
.
t
V
w
W
p
8
E
h
l
c
s
7
r
\195
j
B
e
u
z
Non-zero for 22.9% of words.
^MOSCOW$ ____ (3.0) ^Clic k$____ (2.9) ^CITY $____ (2.9) ^adjac ent$____ (2.6) ^CIT$ ____ (2.6) ^FRANCISC O$____ (2.6) ^VW$ ____(2.5) ^Clay $____ (2.5) ^Clay ton$____ (2.5) ^Cric ket$____ (2.4) ^CIA$ ____ (2.4) ^Clim ate$____ (2.3) ^CEO$ ____ (2.3) ^jih ad$____(2.3) ^Jac kson$____(2.2) ^Jac k$____(2.2) ^Jac ob$____(2.2) ^Jac obs$____(2.2) ^Sunni$ ____ (2.2) ^IPO$ ____ (2.1) ^Jay $____(2.1) ^Jay s$____(2.1) ^Jag uar$____(2.1) ^Punjab $____ (2.0) ^Cliv e$____ (2.0) ^Benjam in$____ (2.0) ^CNBC $____ (2.0) ^Jo$ ____(2.0) ^jac ket$____(2.0) ^jac kets$____(2.0) ^Jim $____(1.9) ^Jim my$____(1.9) ^JP$ ____(1.9) ^enjoy $____ (1.9) ^enjoy ed$____ (1.9) ^enjoy ing$____ (1.9) ^enjoy s$____ (1.9) ^Anna$ ____ (1.9) ^Madonna$ ____ (1.9) ^Vienna$ ____ (1.9) Filter 27 (bias = -0.51) #
W
p
H
P
X
f
w
d
4
j
Z
l
Q
-
A
r
2
z
q
R
^
c
/
s
Y
v
"
o
M
x
J
D
G
0
b
g
v
V
d
G
a
Y
q
i
N
Z
u
S
t
m
D
j
f
4
B
M
.
H
T
X
c
3
x
5
\163
C
U
)
o
%
e
I
\$
k
-
V
u
p
L
C
l
Q
o
b
.
x
D
F
m
9
w
8
d
"
t
B
U
r
s
7
z
P
E
c
/
X
J
R
O
'
M
q
e
6
r
7
t
8
T
x
o
S
k
\194
w
4
A
5
y
Q
n
F
B
V
m
X
i
9
p
s
H
"
q
j
u
d
\195
$
N
G
U
E
a
Non-zero for 20.3% of words.
^Wiki pedia$____ (3.1) ^Mike $____ (2.7) ^Nike $____ (2.6) ^Wire less$____ (2.6) ^hike $____ (2.5) ^hike s$____ (2.5) ^GPS $____(2.4) ^Mikh ail$____ (2.2) ^wipe d$____ (2.2) ^wipe $____ (2.2) ^NYC$ ____ (2.2) ^Wins ton$____ (2.1) ^GPs $____(2.1) ^ships $____ (2.1) ^relationships $____ (2.1) ^championships $____ (2.1) ^chips $____ (2.1) ^Ask$ ____ (2.1) ^G8$ ____(2.0) ^Agre ement$____ (2.0) ^GB$ ____(2.0) ^Like $____ (2.0) ^talks $____ (2.0) ^walks $____ (2.0) ^Talks $____ (2.0) ^Wind ows$____ (2.0) ^Wind $____ (2.0) ^Wind sor$____ (2.0) ^gre at$____(2.0) ^gre ater$____(2.0) ^gre en$____(2.0) ^gre w$____(2.0) ^gre atest$____(2.0) ^gre enhouse$____(2.0) ^risks $____ (2.0) ^asks $____ (2.0) ^tasks $____ (2.0) ^masks $____ (2.0) ^Hawks $____ (1.9) ^Seahawks $____ (1.9) Filter 28 (bias = -0.47) #
<BOS>
y
Q
h
E
d
w
u
W
m
X
p
I
c
N
D
S
L
2
C
K
x
O
g
e
l
"
n
^
U
5
P
1
a
v
H
g
f
Y
x
R
L
T
y
k
K
I
N
j
F
-
B
V
n
;
a
G
.
C
\$
Q
o
r
e
z
,
'
m
&
%
)
w
i
5
\194
8
I
u
Q
M
l
f
A
m
X
v
p
y
7
c
O
U
a
o
2
-
S
x
W
B
t
F
q
n
s
Z
k
R
'
h
R
y
V
h
-
D
s
d
b
p
k
L
z
t
w
c
J
,
E
F
Q
q
'
l
U
H
G
f
Z
1
Y
o
9
x
$
T
\195
A
"
n
Non-zero for 13.3% of words.
^TAR P$____(2.7) ^ERA$ ____ (2.5) ^YOR K$____(2.4) ^ETA$ ____ (2.3) ^WTA$ ____ (2.3) ^NASCAR $____ (2.3) ^III$ ____ (2.3) ^Wils on$____ (2.2) ^TOR ONTO$____(2.2) ^IRA$ ____ (2.2) ^XVI$ ____ (2.2) ^MRI$ ____ (2.1) ^Wilk inson$____ (2.1) ^HIV $____(2.1) ^Vegas $____ (2.0) ^Fabregas $____ (2.0) ^gas $____(2.0) ^gas oline$____(2.0) ^gas es$____(2.0) ^Ras mussen$____(2.0) ^Ras hid$____(2.0) ^II$ ____(1.9) ^NYSE $____ (1.9) ^ATLANTA$ ____ (1.9) ^eggs $____ (1.9) ^WTO$ ____ (1.9) ^IAE A$____(1.9) ^Ips wich$____(1.9) ^Tas k$____(1.8) ^CHICAG O$____ (1.8) ^TIM E$____(1.7) ^Tak e$____(1.7) ^Tak ing$____(1.7) ^PAR IS$____(1.7) ^legis lation$____ (1.7) ^regis tered$____ (1.7) ^legis lative$____ (1.7) ^regis ter$____ (1.7) ^YOU $____(1.7) ^Ris k$____(1.7) Filter 29 (bias = -0.48) #
B
p
.
i
b
O
v
P
<BOS>
d
N
y
q
s
M
E
V
I
\194
G
Z
1
n
U
x
2
9
,
F
-
h
z
c
3
k
S
X
a
Q
o
I
f
i
F
W
x
Y
.
1
L
T
r
\194
e
2
y
C
b
H
m
z
\$
&
%
w
h
k
N
V
j
X
l
4
S
o
p
s
E
n
S
m
O
C
Y
d
h
.
3
D
G
v
o
c
r
t
"
Z
R
/
4
'
W
U
J
z
g
P
9
f
N
\194
x
l
e
F
8
I
Y
f
V
B
\194
y
$
c
g
F
Q
N
S
K
7
r
X
e
G
a
4
P
"
U
n
v
t
p
D
o
_
w
Non-zero for 25.6% of words.
^BCS$ ____ (3.1) ^Fabio$ ____ (2.7) ^biog raphy$____ (2.7) ^autobiog raphy$____ (2.7) ^BAE$ ____ (2.6) ^Silvio$ ____ (2.6) ^NHS$ ____ (2.6) ^PARIS$ ____ (2.5) ^TORONTO$ ____ (2.5) ^Big$ ____ (2.5) ^HBOS$ ____ (2.5) ^RBIs$ ____ (2.5) ^big$ ____ (2.4) ^bigg est$____ (2.4) ^bigg er$____ (2.4) ^Robbie$ ____ (2.4) ^Debbie$ ____ (2.4) ^Lockerbie$ ____ (2.4) ^biol ogical$____ (2.3) ^biol ogy$____ (2.3) ^movie$ ____ (2.3) ^Movie$ ____ (2.3) ^cannabis$ ____ (2.2) ^SOURCE$ ____ (2.2) ^viol ence$____ (2.2) ^viol ent$____ (2.2) ^viol ations$____ (2.2) ^viol ated$____ (2.2) ^Mir$ ____ (2.2) ^Virg inia$____ (2.1) ^Virg in$____ (2.1) ^Antonio$ ____ (2.1) ^Davis$ ____ (2.1) ^Elvis$ ____ (2.1) ^Travis$ ____ (2.1) ^IS$ ____(2.1) ^DENVER $____ (2.0) ^Ohio$ ____ (2.0) ^1.25$ ____ (1.9) ^Bashir$ ____ (1.9) Filter 30 (bias = -0.59) #
<BOS>
t
X
y
9
p
Z
T
7
m
Q
c
6
g
8
o
J
z
N
h
2
k
4
i
3
s
5
A
V
O
F
U
I
a
j
G
n
l
^
S
F
k
e
-
\$
v
N
Y
E
z
2
u
f
m
5
R
4
'
8
T
L
g
3
C
S
c
W
i
6
)
K
G
Q
V
X
U
,
\195
(
l
Y
k
S
B
j
f
7
U
G
w
h
a
5
b
g
r
4
m
l
v
6
t
\194
K
3
n
0
q
1
P
8
N
$
z
O
\195
L
c
X
.
Y
f
\194
r
4
B
7
t
V
K
6
A
g
N
-
F
1
a
$
p
Z
P
X
k
"
o
H
,
G
U
W
l
8
e
3
b
0
_
O
Non-zero for 25.9% of words.
^1957$ ____ (3.1) ^1947$ ____ (3.0) ^1987$ ____ (3.0) ^1955$ ____ (3.0) ^1954$ ____ (2.9) ^1945$ ____ (2.9) ^1985$ ____ (2.9) ^1944$ ____ (2.9) ^787$ ____ (2.9) ^NFL$ ____ (2.9) ^1984$ ____ (2.9) ^1956$ ____ (2.8) ^1967$ ____ (2.8) ^1934$ ____ (2.8) ^1946$ ____ (2.8) ^1986$ ____ (2.8) ^1953$ ____ (2.7) ^1950$ ____ (2.7) ^1965$ ____ (2.7) ^1951$ ____ (2.7) ^1964$ ____ (2.7) ^1943$ ____ (2.7) ^1983$ ____ (2.7) ^1940$ ____ (2.7) ^1958$ ____ (2.6) ^1980$ ____ (2.6) ^NY$ ____(2.6) ^1941$ ____ (2.6) ^1981$ ____ (2.6) ^750$ ____ (2.6) ^1966$ ____ (2.6) ^1933$ ____ (2.6) ^Schwarzenegg er$____ (2.6) ^egg s$____(2.6) ^egg $____(2.6) ^1977$ ____ (2.6) ^1948$ ____ (2.6) ^1988$ ____ (2.6) ^BEIJING$ ____ (2.5) ^1963$ ____ (2.5) Filter 31 (bias = -0.47) #
<BOS>
f
w
h
W
F
Q
P
O
y
z
m
E
u
I
L
X
x
"
p
^
r
\194
H
2
n
G
d
l
k
b
D
j
B
k
D
V
o
C
d
;
.
'
E
Q
-
P
e
R
L
b
l
B
v
Z
0
"
t
F
g
Y
w
r
u
f
T
U
j
)
c
p
h
,
z
X
f
L
p
H
o
Q
S
7
c
q
t
6
y
Z
O
9
s
u
m
8
i
a
k
b
'
\195
g
J
w
2
M
D
C
1
K
Y
x
d
,
y
B
d
k
H
v
1
b
u
t
s
q
4
T
-
l
Z
f
G
r
3
N
"
o
8
J
2
z
U
K
6
x
$
w
L
A
7
\195
D
R
Non-zero for 15.5% of words.
^okay $____ (2.6) ^key $____(2.4) ^key s$____(2.4) ^key board$____(2.4) ^key note$____(2.4) ^blockad e$____ (2.3) ^Pay ne$____(2.2) ^Pay $____(2.2) ^Brookly n$____ (2.2) ^Cad bury$____(2.2) ^Cad illac$____(2.2) ^Ray $____(2.1) ^Ray mond$____(2.1) ^Ray s$____(2.1) ^quickly $____ (2.1) ^buy $____(2.1) ^buy ing$____(2.1) ^buy ers$____(2.1) ^buy er$____(2.1) ^Vau xhall$____(2.0) ^Vau ghan$____(2.0) ^Buy $____(2.0) ^Rud d$____(2.0) ^Rud y$____(2.0) ^Pad res$____(2.0) ^bay $____(2.0) ^Bay $____(1.9) ^Bay ern$____(1.9) ^Bay lor$____(1.9) ^Rad io$____(1.9) ^Rad cliffe$____(1.9) ^low-key $____ (1.9) ^bud get$____(1.9) ^bud gets$____(1.9) ^weekly $____ (1.8) ^Weekly $____ (1.8) ^Blackbu rn$____ (1.8) ^blockbu ster$____ (1.8) ^Cus toms$____(1.8) ^Cus tomers$____(1.8) Filter 32 (bias = -0.77) #
<BOS>
t
L
T
/
g
X
k
Z
h
Q
i
<EOS>
y
K
p
l
j
9
e
\194
E
z
c
6
u
.
r
8
H
7
\163
B
d
b
-
5
v
N
F
W
s
X
f
V
l
2
t
4
C
Y
.
E
o
6
n
q
A
"
c
H
L
3
F
b
,
)
D
Z
U
w
%
9
d
1
'
J
r
7
z
f
z
F
.
B
-
i
U
h
G
k
g
t
u
,
d
S
s
M
E
p
L
j
Z
H
a
4
\195
x
b
n
w
5
Q
W
R
6
r
T
A
W
j
w
F
z
P
K
H
o
f
"
k
O
r
Q
h
a
i
.
C
N
n
X
g
\194
u
v
d
B
l
c
p
$
-
Y
J
/
e
q
s
Non-zero for 23.8% of words.
^Wiz ards$____(4.1) ^Who $____(3.8) ^Wha t$____(3.5) ^Wha tever$____(3.5) ^WHO $____(3.1) ^Califo rnia$____ (3.1) ^Vio lence$____(3.0) ^biz arre$____(2.9) ^WTO $____(2.9) ^Halifa x$____ (2.8) ^Wyo ming$____(2.8) ^Crawfo rd$____ (2.8) ^Befo re$____ (2.8) ^befo re$____ (2.8) ^Silvio $____ (2.7) ^ANGELES$ ____ (2.7) ^Via com$____(2.7) ^JERUSALEM$ ____ (2.7) ^Wea ther$____(2.7) ^awkw ard$____ (2.6) ^likeliho od$____ (2.6) ^Betw een$____ (2.6) ^Woo ds$____(2.6) ^Woo d$____(2.6) ^Woo dward$____(2.6) ^Woo dy$____(2.6) ^bio logical$____(2.6) ^bio fuels$____(2.6) ^bio graphy$____(2.6) ^bio logy$____(2.6) ^betw een$____ (2.6) ^Netw ork$____ (2.6) ^Netw orks$____ (2.6) ^BMW $____(2.5) ^unifo rm$____ (2.5) ^unifo rms$____ (2.5) ^Leic ester$____ (2.5) ^Leic estershire$____ (2.5) ^HBO S$____(2.5) ^HBO $____(2.5) Filter 33 (bias = -0.54) #
N
g
W
-
B
j
K
m
Q
p
,
G
/
d
O
v
q
s
"
l
X
'
a
V
H
b
A
k
2
c
3
z
I
C
<BOS>
x
8
i
9
J
W
g
\194
U
"
P
:
C
o
r
x
k
X
)
(
A
Q
F
v
G
Y
R
S
n
N
Z
6
u
O
c
&
p
/
s
q
\195
D
z
Z
f
g
t
H
d
V
p
G
v
Y
P
A
s
w
,
X
F
4
x
1
o
3
S
7
'
2
O
L
l
n
T
5
-
M
e
h
k
0
I
f
g
F
u
x
T
B
G
N
Y
9
H
e
m
K
h
Q
y
6
D
8
U
X
A
W
c
5
i
,
z
2
C
q
-
7
1
b
d
S
l
Non-zero for 11.4% of words.
^Norf olk$____ (3.1) ^None $____ (3.0) ^None theless$____ (3.0) ^Nobe l$____ (2.8) ^Nige ria$____ (2.8) ^Nige rian$____ (2.8) ^Nige l$____ (2.8) ^Nige r$____ (2.8) ^Now$ ____ (2.6) ^NW$_ ___ (2.3) ^Wolf $____ (2.3) ^N.Y. $____ (2.1) ^D-N.Y. $____ (2.1) ^Winf rey$____ (2.1) ^No.$ ____ (2.0) ^toge ther$____ (2.0) ^altoge ther$____ (2.0) ^Hoga n$____ (2.0) ^NBA$ ____ (2.0) ^Kobe $____ (2.0) ^Wome n$____ (1.9) ^Newa rk$____ (1.9) ^Alge ria$____ (1.9) ^Wagn er$____ (1.8) ^Toge ther$____ (1.8) ^Howe ver$____ (1.8) ^WAS HINGTON$____(1.8) ^Whe n$____(1.8) ^Whe re$____(1.8) ^Whe ther$____(1.8) ^Whe eler$____(1.8) ^Nine $____ (1.8) ^NYC$ ____ (1.7) ^Nobo dy$____ (1.7) ^Wiga n$____ (1.7) ^New$ ____ (1.7) ^hydroge n$____ (1.7) ^ATLANTA$ ____ (1.7) ^Norw ay$____ (1.7) ^Norw egian$____ (1.7) Filter 34 (bias = -0.51) #
P
-
C
.
p
h
k
u
U
v
r
l
K
j
R
M
c
H
z
m
G
\194
,
e
F
q
Q
W
I
Y
O
<BOS>
s
L
B
w
f
J
a
o
Q
-
F
o
S
1
s
i
V
u
b
J
k
H
"
D
'
T
\$
d
E
n
U
h
A
p
f
y
.
0
&
q
B
\195
!
v
:
l
x
g
X
d
Z
u
w
s
K
U
n
v
V
h
M
x
N
y
J
a
/
T
Q
D
r
F
5
E
j
t
I
c
\195
\163
f
k
"
z
Y
c
h
p
X
z
Q
t
H
U
7
P
4
T
V
k
\194
D
"
C
6
d
W
f
8
K
$
v
b
s
.
O
9
w
x
y
Z
I
3
_
Non-zero for 20.1% of words.
^Copenh agen$____ (2.4) ^Perh aps$____ (2.3) ^Twickenh am$____ (2.3) ^PKK$ ____ (2.2) ^Pew$ ____ (2.2) ^Pan$ ____ (2.2) ^Can$ ____ (2.1) ^perh aps$____ (2.1) ^USA$ ____ (2.0) ^Isn$ ____ (2.0) ^U.K. $____ (1.9) ^Japan$ ____ (1.9) ^span$ ____ (1.9) ^pan$ ____ (1.9) ^Greenspan$ ____ (1.9) ^CEO$ ____ (1.9) ^CBI$ ____ (1.9) ^CNN$ ____ (1.9) ^collapse$ ____ (1.9) ^glimpse$ ____ (1.9) ^open$ ____ (1.8) ^happen$ ____ (1.8) ^Open$ ____ (1.8) ^reopen$ ____ (1.8) ^Lankan$ ____ (1.8) ^Car$ ____ (1.8) ^JERUSAL EM$____ (1.8) ^wasn$ ____ (1.8) ^hasn$ ____ (1.8) ^U.N. $____ (1.8) ^USC$ ____ (1.8) ^NCAA$ ____ (1.8) ^taken$ ____ (1.8) ^broken$ ____ (1.8) ^spoken$ ____ (1.8) ^chicken$ ____ (1.8) ^draw$ ____ (1.8) ^withdraw$ ____ (1.8) ^raw$ ____ (1.8) ^Straw$ ____ (1.8) Filter 35 (bias = -0.36) #
<BOS>
T
w
h
.
k
Z
p
-
Y
Q
i
n
y
X
D
I
U
N
P
/
o
2
S
W
v
^
l
u
B
G
c
0
x
E
h
X
C
w
n
e
x
W
A
O
c
K
u
Q
k
;
l
2
s
M
F
"
g
I
y
J
L
-
a
\163
R
P
H
T
.
:
d
G
'
V
d
k
y
B
D
Y
-
W
u
Q
L
C
.
w
o
X
f
Z
h
4
r
S
e
$
l
9
m
I
p
z
\163
\194
x
R
P
5
c
i
F
W
j
y
R
m
C
X
l
"
n
K
I
M
J
Q
s
w
F
Z
t
.
0
O
D
b
k
$
u
Y
-
q
r
\163
d
/
7
a
P
S
Non-zero for 9.1% of words.
^Heineke n$____ (2.5) ^Newm an$____ (2.5) ^low-ke y$____ (2.4) ^Egy pt$____(2.2) ^Egy ptian$____(2.2) ^NEW$ ____ (2.2) ^IV$ ____(2.1) ^week$ ____ (2.1) ^seek$ ____ (2.1) ^Greek$ ____ (2.1) ^Week$ ____ (2.1) ^Creek$ ____ (2.1) ^two-week$ ____ (2.1) ^sleek$ ____ (2.1) ^Sky $____(2.1) ^Sky pe$____(2.1) ^ITV$ ____ (2.0) ^awkw ard$____ (2.0) ^Wim bledon$____(1.9) ^web$ ____ (1.9) ^newb orn$____ (1.9) ^weeke nd$____ (1.8) ^weeke nds$____ (1.8) ^seeke rs$____ (1.8) ^TV$ ____(1.8) ^new$ ____ (1.8) ^knew$ ____ (1.8) ^renew$ ____ (1.8) ^Kim $____(1.7) ^runway $____ (1.7) ^Conway $____ (1.7) ^renewa ble$____ (1.7) ^renewa l$____ (1.7) ^African-Am erican$____ (1.7) ^African-Am ericans$____ (1.7) ^DIEGO $____ (1.6) ^renewe d$____ (1.6) ^newe st$____ (1.6) ^newe r$____ (1.6) ^eBa y$____(1.6) Filter 36 (bias = -0.39) #
Q
u
<BOS>
J
O
v
W
U
X
R
p
h
I
L
w
B
t
x
^
s
/
D
"
9
j
F
b
0
-
\195
f
k
Y
a
G
i
R
h
K
w
Q
n
&
x
O
s
X
t
M
F
T
\$
;
f
J
d
r
H
"
e
z
A
o
q
D
4
\195
I
P
2
\194
.
D
b
t
-
T
J
,
f
C
j
c
x
/
s
y
E
O
e
W
g
\194
r
1
V
A
k
o
F
$
u
K
\195
H
P
Q
R
X
w
N
m
c
f
Q
-
0
m
\194
s
X
l
"
w
8
u
C
i
7
o
9
L
Y
a
T
r
G
n
D
U
v
A
V
e
6
t
$
.
1
y
W
\195
Non-zero for 16.8% of words.
^MDC $____(2.1) ^OTC $____(2.0) ^McQ ueen$____(2.0) ^Roc k$____(2.0) ^Roc kies$____(2.0) ^Roc kets$____(2.0) ^Roc kefeller$____(2.0) ^occ urred$____(1.9) ^occ ur$____(1.9) ^occ asion$____(1.9) ^occ upied$____(1.9) ^occ asionally$____(1.9) ^occ asions$____(1.9) ^occ asional$____(1.9) ^spot$ ____ (1.8) ^pot$ ____ (1.8) ^Depot$ ____ (1.8) ^IRA$ ____ (1.8) ^McC ain$____(1.8) ^McC hrystal$____(1.8) ^McC arthy$____(1.8) ^McC onnell$____(1.8) ^McC artney$____(1.8) ^McC ann$____(1.8) ^TD$ ____(1.8) ^proc ess$____ (1.8) ^proc edures$____ (1.8) ^proc eedings$____ (1.8) ^proc edure$____ (1.8) ^YORK$ ____ (1.7) ^MTV $____(1.7) ^IPCC $____ (1.7) ^TOKYO$ ____ (1.7) ^Socc er$____ (1.7) ^NYC$ ____ (1.7) ^WTO$ ____ (1.7) ^FARC$ ____ (1.7) ^Ky$ ____(1.7) ^IOC$ ____ (1.7) ^ACORN$ ____ (1.6) Filter 37 (bias = -0.57) #
Y
z
H
w
h
A
7
a
u
K
4
t
R
p
9
m
j
f
6
.
3
c
-
U
\194
B
8
l
1
s
"
y
J
O
V
L
M
b
^
e
X
k
Q
U
:
y
\194
c
7
C
(
m
6
A
-
f
"
s
J
a
&
u
W
p
Y
n
5
B
j
h
9
t
O
F
8
)
i
g
k
d
V
h
m
.
B
u
b
N
C
L
K
D
z
-
M
H
G
o
Z
a
v
E
'
e
c
q
p
l
w
y
P
1
Y
r
i
I
T
A
A
f
g
x
u
p
U
F
.
P
E
K
z
B
H
k
Y
b
I
n
w
v
T
m
G
9
t
e
D
'
$
J
1
M
a
j
s
,
Q
8
Non-zero for 10.9% of words.
^XVI $____(3.6) ^bulk$ ____ (3.1) ^Abdulmu tallab$____ (2.9) ^Nku nda$____(2.9) ^CHICA GO$____ (2.8) ^HIV$ ____ (2.5) ^milk$ ____ (2.2) ^silk$ ____ (2.2) ^Milk$ ____ (2.2) ^NBA $____(2.2) ^VEGA S$____ (2.2) ^highlig hted$____ (2.1) ^highlig ht$____ (2.1) ^highlig hts$____ (2.1) ^highlig hting$____ (2.1) ^them$ ____ (2.1) ^anthem$ ____ (2.1) ^health-ca re$____ (2.1) ^NCA A$____(2.1) ^thems elves$____ (2.1) ^QC$ ____(2.0) ^whom$ ____ (2.0) ^Thoma s$____ (1.9) ^Oklahoma $____ (1.9) ^Thoms on$____ (1.9) ^WTA $____(1.9) ^oka y$____(1.9) ^Heig hts$____ (1.9) ^heig ht$____ (1.9) ^heig htened$____ (1.9) ^heig hts$____ (1.9) ^10m$ ____ (1.8) ^non-GA AP$____ (1.8) ^Rev. $____ (1.8) ^bulbs $____ (1.8) ^Roma n$____ (1.7) ^Roma nia$____ (1.7) ^Roma $____ (1.7) ^Roma nian$____ (1.7) ^IV$ ____(1.7) Filter 38 (bias = -0.50) #
C
-
k
.
B
h
U
d
V
y
I
o
R
x
K
g
<BOS>
e
z
u
Q
r
,
q
Z
l
P
L
9
p
\194
m
A
f
T
b
5
\163
F
E
K
u
X
h
f
t
!
H
b
T
Q
g
%
k
N
C
L
)
P
i
J
d
:
v
/
s
;
c
p
1
O
D
l
U
Z
A
w
4
m
j
v
p
.
i
\194
f
b
y
X
o
Q
r
q
S
"
O
z
,
W
C
Z
n
u
P
a
h
6
j
d
F
T
g
$
3
E
K
B
s
9
G
I
c
l
v
P
M
Q
y
7
w
X
m
r
n
O
u
A
o
p
x
S
.
Y
f
R
h
H
Z
\195
B
J
'
j
g
L
k
E
-
W
Non-zero for 18.6% of words.
^Cabl e$____ (2.1) ^Coul d$____ (2.1) ^Coal ition$____ (2.1) ^Coal $____ (2.1) ^Djokovi c$____ (2.1) ^Jankovi c$____ (2.1) ^Cove ntry$____ (2.0) ^remarkabl e$____ (2.0) ^remarkabl y$____ (2.0) ^Cava liers$____ (1.9) ^Frankfur t$____ (1.9) ^Boul evard$____ (1.9) ^Co.$ ____ (1.9) ^Blackbur n$____ (1.7) ^Bowl $____ (1.7) ^Clar k$____ (1.7) ^Clar ke$____ (1.7) ^Clar kson$____ (1.7) ^Clar ence$____ (1.7) ^K.$ ____(1.7) ^Cabr era$____ (1.6) ^Cour t$____ (1.6) ^Cour ic$____ (1.6) ^Beve rly$____ (1.6) ^Kel ly$____(1.5) ^Kel ler$____(1.5) ^awkwar d$____ (1.5) ^backwar d$____ (1.5) ^backwar ds$____ (1.5) ^MOSCOW$ ____ (1.5) ^Coll ege$____ (1.5) ^Coll ins$____ (1.5) ^Coll ingwood$____ (1.5) ^Coll ection$____ (1.5) ^thoughtful $____ (1.5) ^doubtful $____ (1.5) ^shortfal l$____ (1.4) ^ful l$____(1.4) ^ful ly$____(1.4) ^ful l-time$____(1.4) Filter 39 (bias = -0.39) #
W
D
w
P
4
T
<BOS>
U
q
r
X
F
2
m
Q
c
3
d
x
f
5
C
6
l
^
p
\194
j
"
t
7
L
z
k
y
R
J
y
V
d
Y
D
j
c
l
?
-
F
S
U
i
.
w
a
R
\163
o
!
b
u
%
T
5
t
9
H
G
\$
z
h
B
C
0
P
3
q
O
v
K
j
W
h
w
x
I
F
U
.
z
d
E
-
Q
u
2
n
X
c
3
g
G
'
,
f
P
C
a
D
\195
l
N
b
A
s
/
m
X
u
V
c
W
.
b
y
B
D
6
o
Q
n
Y
-
9
s
7
t
J
d
4
C
2
r
"
g
x
U
K
A
p
_
q
m
k
'
\194
L
Non-zero for 13.2% of words.
^Job s$____(2.3) ^Job $____(2.3) ^SUV $____(2.1) ^Wiza rds$____ (1.9) ^Jap an$____(1.9) ^Jap anese$____(1.9) ^Jak e$____(1.9) ^Jak arta$____(1.9) ^GOP $____(1.9) ^VW$ ____(1.8) ^Jav ier$____(1.7) ^lab or$____(1.7) ^lab el$____(1.7) ^lab $____(1.7) ^lab our$____(1.7) ^job $____(1.7) ^job s$____(1.7) ^job less$____(1.7) ^XVI$ ____ (1.7) ^Punjab $____ (1.7) ^Wii$ ____ (1.7) ^JP$ ____(1.6) ^Sab athia$____(1.6) ^Sab eri$____(1.6) ^lib eral$____(1.6) ^lib rary$____(1.6) ^lib erals$____(1.6) ^lib erties$____(1.6) ^Work ers$____ (1.6) ^Work $____ (1.6) ^Work ing$____ (1.6) ^Work s$____ (1.6) ^YOR K$____(1.6) ^Wimb ledon$____ (1.6) ^Joi nt$____(1.6) ^lob by$____(1.6) ^lob bying$____(1.6) ^lob byists$____(1.6) ^lob byist$____(1.6) ^TOKYO$ ____ (1.5) Filter 40 (bias = -0.47) #
V
v
Z
.
X
E
C
o
Y
x
H
t
G
-
/
e
6
u
7
f
^
r
P
q
4
a
M
N
1
w
i
c
m
b
Q
\163
3
d
5
h
F
m
S
p
E
g
j
a
f
n
Q
v
e
1
\$
!
r
i
N
z
R
y
s
c
t
w
O
d
"
h
(
q
I
x
M
G
;
Z
,
?
Z
x
M
o
X
p
A
v
j
R
.
u
L
a
m
k
H
d
/
s
F
f
w
h
e
-
V
i
g
9
D
P
Q
"
5
U
t
r
n
'
Q
p
\194
x
$
f
"
b
S
n
Y
q
/
w
s
v
r
a
e
o
h
g
J
B
-
\195
c
k
Non-zero for 14.5% of words.
^FA$ ____(2.6) ^F.$ ____(2.5) ^S.$ ____(2.5) ^JERUSALEM$ ____ (2.2) ^E.$ ____(2.1) ^USA$ ____ (2.0) ^NYSE$ ____ (2.0) ^FAR C$____(2.0) ^St$ ____(2.0) ^S.C .$____(2.0) ^Q.$ ____(1.9) ^CIA$ ____ (1.9) ^Fes tival$____(1.9) ^AZUZ$ ____ (1.9) ^FIFA$ ____ (1.8) ^EMI $____(1.8) ^NASA$ ____ (1.8) ^N.$ ____(1.7) ^life$ ____ (1.7) ^wife$ ____ (1.7) ^Life$ ____ (1.7) ^knife$ ____ (1.7) ^wildlife$ ____ (1.7) ^Wildlife$ ____ (1.7) ^Mr.$ ____ (1.7) ^MRSA$ ____ (1.7) ^R.$ ____(1.7) ^NL$ ____(1.7) ^PHILADELPHIA$ ____ (1.7) ^FSA$ ____ (1.7) ^FC$ ____(1.7) ^Ms.$ ____ (1.7) ^NAS CAR$____(1.7) ^NAS DAQ$____(1.7) ^em$ ____(1.6) ^Yet$ ____ (1.6) ^PM$ ____(1.6) ^firm$ ____ (1.6) ^confirm$ ____ (1.6) ^Pew$ ____ (1.6) Filter 41 (bias = -0.37) #
<BOS>
y
V
o
X
c
b
r
l
O
Y
N
\194
t
m
p
J
,
Z
a
6
f
j
h
Q
E
v
1
-
A
7
\163
^
w
L
U
'
e
K
Z
t
J
S
\195
h
X
s
M
x
V
y
n
d
w
,
)
f
b
\$
K
p
9
a
P
O
-
c
R
F
?
W
G
"
!
\194
r
i
L
'
u
p
/
T
U
c
Q
e
R
g
Z
j
.
t
$
F
-
v
N
0
n
D
Y
E
\194
S
a
G
\163
k
h
f
y
i
\194
-
Y
I
M
i
"
s
X
w
v
a
V
p
Q
n
c
f
T
r
8
d
Z
A
B
E
$
l
b
u
W
P
9
,
0
o
6
t
m
2
Non-zero for 21.5% of words.
^Juv entus$____(3.9) ^Zum a$____(3.5) ^Zac h$____(3.4) ^Jav ier$____(3.3) ^J.$ ____(3.3) ^Jac kson$____(3.2) ^Jac k$____(3.2) ^Jac ob$____(3.2) ^Jac obs$____(3.2) ^N.J.$ ____ (3.0) ^Muc h$____(2.8) ^AZUZ $____ (2.7) ^Jam es$____(2.7) ^Jam ie$____(2.7) ^Jam aica$____(2.7) ^nuc lear$____(2.6) ^nuc lear-armed$____(2.6) ^buc k$____(2.5) ^buc ket$____(2.5) ^album $____ (2.4) ^album s$____ (2.4) ^Mub arak$____(2.4) ^Jak e$____(2.4) ^Jak arta$____(2.4) ^JPM organ$____(2.3) ^Xav ier$____(2.3) ^Mav ericks$____(2.3) ^Jo$ ____(2.2) ^Mum bai$____(2.2) ^M.$ ____(2.2) ^Job s$____(2.2) ^Job $____(2.2) ^Mac $____(2.2) ^Mac y$____(2.2) ^Mac k$____(2.2) ^Zaz i$____(2.2) ^G.M .$____(2.2) ^Jr$ ____(2.1) ^Zur ich$____(2.1) ^bub ble$____(2.1) Filter 42 (bias = -0.43) #
<BOS>
f
X
t
V
y
Y
a
Q
,
Z
U
G
h
-
F
g
s
^
B
J
x
\194
A
7
o
"
u
j
p
M
k
b
i
c
n
d
p
v
!
o
P
u
V
B
m
T
G
q
C
N
F
h
Z
J
g
t
'
0
s
9
Q
W
y
R
X
a
S
\194
f
Y
)
D
j
x
/
E
Q
-
y
j
"
J
K
i
W
l
Z
v
.
u
c
p
U
k
/
n
8
g
X
I
$
t
G
P
N
R
A
\195
L
o
O
f
a
T
0
v
O
x
r
q
S
6
G
n
U
9
g
d
s
\194
A
8
E
X
R
7
t
0
Y
5
k
B
y
.
z
2
T
W
i
a
m
b
o
1
K
Non-zero for 22.0% of words.
^pav ement$____(3.6) ^pav ed$____(3.6) ^pav e$____(3.6) ^Gav in$____(3.4) ^Cav aliers$____(3.3) ^Fav re$____(3.2) ^gav e$____(3.2) ^max imum$____(3.0) ^max imize$____(3.0) ^syn drome$____(3.0) ^syn thetic$____(3.0) ^sav e$____(2.9) ^sav ings$____(2.9) ^sav ed$____(2.9) ^sav ing$____(2.9) ^Syd ney$____(2.6) ^Xav ier$____(2.6) ^Sav e$____(2.6) ^pan el$____(2.6) ^pan ic$____(2.6) ^pan els$____(2.6) ^pan demic$____(2.6) ^pan ts$____(2.6) ^pan $____(2.6) ^fav orite$____(2.6) ^fav or$____(2.6) ^fav our$____(2.6) ^fav ourite$____(2.6) ^pad $____(2.6) ^pad s$____(2.6) ^Pan thers$____(2.5) ^Pan ama$____(2.5) ^Pan el$____(2.5) ^Pan $____(2.5) ^Pad res$____(2.5) ^man-mad e$____ (2.4) ^Van $____(2.4) ^Van couver$____(2.4) ^Van essa$____(2.4) ^man y$____(2.4) Filter 43 (bias = -0.65) #
5
t
Z
-
6
k
3
r
8
'
2
.
K
p
4
m
9
v
X
l
1
u
L
f
7
b
0
g
G
d
J
s
M
T
N
h
W
q
D
a
I
m
p
L
r
%
C
K
Q
f
t
U
R
M
d
w
T
b
k
Z
7
B
1
.
q
J
"
l
O
x
\163
s
g
z
c
u
j
/
(
!
K
-
G
u
Q
j
Y
n
X
w
z
e
P
t
b
h
U
d
V
i
p
.
B
f
"
H
m
v
O
s
L
o
T
g
Z
I
k
E
A
4
d
B
\194
m
7
k
Q
U
8
K
6
b
.
r
C
T
5
f
$
\195
-
J
c
P
"
A
4
w
x
M
'
i
1
_
D
E
2
L
0
z
Non-zero for 11.7% of words.
^50m$ ____ (3.8) ^23rd $____ (3.6) ^Grad e$____ (3.6) ^Zard ari$____ (3.4) ^2007 $____ (3.2) ^2007 .$____ (3.2) ^Liz$ ____ (3.2) ^Kim$ ____ (3.1) ^1978$ ____ (3.1) ^270$ ____ (3.1) ^210$ ____ (3.1) ^LSU$ ____ (3.1) ^TOKYO$ ____ (3.1) ^10-K$ ____ (3.0) ^500$ ____ (3.0) ^1,500$ ____ (3.0) ^2,500$ ____ (3.0) ^3,500$ ____ (3.0) ^GOP$ ____ (3.0) ^Lib$ ____ (3.0) ^2008 $____ (2.9) ^2008 .$____ (2.9) ^2006 $____ (2.9) ^10m$ ____ (2.8) ^2005 $____ (2.8) ^600$ ____ (2.8) ^1,600$ ____ (2.8) ^300$ ____ (2.8) ^1,300$ ____ (2.8) ^Kurd ish$____ (2.8) ^Kurd s$____ (2.8) ^Kurd istan$____ (2.8) ^800$ ____ (2.8) ^1,800$ ____ (2.8) ^1987 $____ (2.8) ^50,0 00$____ (2.7) ^250,0 00$____ (2.7) ^150,0 00$____ (2.7) ^350,0 00$____ (2.7) ^Brad $____ (2.7) Filter 44 (bias = -0.48) #
<BOS>
D
-
P
w
F
.
y
W
L
'
C
Q
1
\194
h
v
p
"
U
^
H
8
0
d
c
K
A
B
T
,
-
f
Y
F
Q
B
:
y
\194
L
"
c
W
K
X
x
(
A
&
D
!
U
V
a
'
p
%
P
n
e
h
m
N
P
.
K
-
U
h
,
v
2
g
1
u
I
j
p
b
C
x
i
r
Z
t
3
q
5
'
6
o
n
e
/
c
X
T
8
Y
4
"
B
\194
Y
f
H
d
h
.
g
s
V
z
4
m
i
U
1
c
7
t
3
F
R
v
J
y
j
e
9
K
0
a
Z
P
X
p
5
D
k
\163
6
x
Non-zero for 3.3% of words.
^al-Qai da$____ (2.6) ^low-in come$____ (2.3) ^Yah oo$____(1.7) ^--$_ ___ (1.6) ^---$_ ___ (1.6) ^al-Qae da$____ (1.4) ^Al-Qae da$____ (1.4) ^Wig an$____(1.3) ^following $____ (1.2) ^growing $____ (1.2) ^showing $____ (1.2) ^allowing $____ (1.2) ^drawing $____ (1.2) ^wing $____ (1.2) ^Wii $____(1.2) ^double-dig it$____ (1.2) ^low-co st$____ (1.2) ^YOR K$____(1.2) ^go-ah ead$____ (1.2) ^Jewish $____ (1.1) ^wish $____ (1.1) ^wish es$____ (1.1) ^wish ed$____ (1.1) ^wish ing$____ (1.1) ^QC$ ____(1.1) ^award-win ning$____ (1.1) ^Oscar-win ning$____ (1.1) ^iPh one$____(1.1) ^iPh ones$____(1.1) ^sub-pr ime$____ (1.0) ^WITH $____ (1.0) ^swung $____ (1.0) ^low-ke y$____ (1.0) ^Rih anna$____(1.0) ^built-in $____ (1.0) ^late-ni ght$____ (0.9) ^Wag ner$____(0.9) ^performance-enh ancing$____ (0.9) ^mid-199 0s$____ (0.9) ^run-up$ ____ (0.9) Filter 45 (bias = -0.49) #
c
l
g
L
v
<BOS>
G
J
W
f
k
N
C
r
h
P
i
\195
y
F
p
j
w
B
s
K
T
/
z
e
Y
R
x
I
'
X
1
b
"
D
K
g
W
k
N
d
X
s
o
F
/
h
9
j
&
u
O
t
M
b
3
p
Q
r
6
A
(
)
5
v
\194
m
8
e
"
c
Z
\163
Y
.
O
n
G
u
K
m
"
j
W
F
p
t
Q
f
E
-
8
.
Y
H
o
l
z
M
X
s
3
k
9
'
0
A
2
h
\163
C
c
v
1
i
x
-
Q
d
Y
u
"
I
V
t
b
E
W
D
X
j
8
J
'
e
h
i
\194
U
$
T
Z
P
9
s
c
l
K
z
S
_
N
w
B
2
Non-zero for 12.3% of words.
^cop$ ____ (3.3) ^copy $____ (3.1) ^copy right$____ (3.1) ^cook ing$____ (2.8) ^cook $____ (2.8) ^cook ed$____ (2.8) ^cook ies$____ (2.8) ^TOKY O$____ (2.7) ^copp er$____ (2.6) ^coor dinator$____ (2.4) ^coor dinated$____ (2.4) ^coor dination$____ (2.4) ^coor dinate$____ (2.4) ^cock tail$____ (2.4) ^cock pit$____ (2.4) ^NO$ ____(2.4) ^hoax $____ (2.4) ^coac h$____ (2.4) ^coac hes$____ (2.4) ^coac hing$____ (2.4) ^coac hed$____ (2.4) ^coop eration$____ (2.4) ^coop erate$____ (2.4) ^coop erating$____ (2.4) ^coop erative$____ (2.4) ^soph isticated$____ (2.3) ^philosoph y$____ (2.3) ^soph omore$____ (2.3) ^philosoph ical$____ (2.3) ^shop$ ____ (2.3) ^Bishop$ ____ (2.3) ^Archbishop$ ____ (2.3) ^hip-hop$ ____ (2.3) ^bishop$ ____ (2.3) ^tycoon $____ (2.3) ^Kob e$____(2.3) ^pop$ ____ (2.2) ^MOSCOW$ ____ (2.2) ^Soph ie$____ (2.1) ^Christoph er$____ (2.1) Filter 46 (bias = -0.49) #
F
p
R
m
Q
G
C
w
<BOS>
o
9
y
N
g
I
z
7
i
u
K
k
O
\194
l
j
W
'
-
8
T
"
L
s
a
n
x
H
b
r
c
W
F
v
j
z
r
w
f
K
H
"
n
B
h
\194
P
X
l
&
C
Y
d
9
y
T
p
0
g
E
\$
G
t
o
A
Q
s
2
i
c
L
S
r
4
q
V
u
s
v
5
T
C
\195
G
a
F
.
6
o
3
B
7
b
$
N
Q
l
8
-
"
A
2
t
W
D
Z
z
i
J
w
.
f
w
P
b
t
Z
p
a
D
W
j
g
F
z
T
V
,
x
r
E
O
G
C
A
o
Y
M
Q
l
s
y
X
S
2
i
U
d
q
k
Non-zero for 12.3% of words.
^vs. $____(3.2) ^FTSE $____ (2.8) ^vib rant$____(2.5) ^RBS$ ____ (2.5) ^Wig an$____(2.3) ^via $____(2.2) ^via ble$____(2.2) ^via bility$____(2.2) ^JERUSA LEM$____ (2.2) ^Web $____(2.2) ^Web b$____(2.2) ^Web ber$____(2.2) ^Web ster$____(2.2) ^Web er$____(2.2) ^Latvia $____ (2.2) ^Bros. $____ (2.2) ^FOXNews. com$____ (2.2) ^PRNewsw ire$____ (2.1) ^PRNewsw ire-FirstCall$____ (2.1) ^PRNewsw ire-USNewswire$____ (2.1) ^NYSE $____ (2.1) ^CBS$ ____ (2.1) ^Wiz ards$____(2.1) ^vig orously$____(2.1) ^vig orous$____(2.1) ^Avia tion$____ (2.0) ^Wea ther$____(1.9) ^Crosb y$____ (1.9) ^Roya l$____ (1.9) ^Roya ls$____ (1.9) ^Kib aki$____(1.9) ^Ross $____ (1.9) ^Ross i$____ (1.9) ^Wha t$____(1.8) ^Wha tever$____(1.8) ^Bib le$____(1.8) ^Nasa $____ (1.8) ^U.S. $____ (1.7) ^U.S. -led$____ (1.7) ^U.S. -backed$____ (1.7) Filter 47 (bias = -0.59) #
Q
f
<BOS>
i
X
t
.
k
Z
y
G
F
7
,
\194
s
Y
p
z
B
0
e
8
U
V
S
^
m
g
h
<EOS>
u
L
T
9
E
/
H
b
M
x
H
v
u
p
L
c
A
'
D
o
U
K
d
k
I
W
E
f
F
V
r
\194
e
S
1
G
j
z
(
"
h
b
%
B
l
9
.
Y
\$
Q
x
R
-
I
m
r
f
X
v
P
.
7
o
Z
h
C
l
9
n
E
s
"
w
8
i
T
d
2
u
3
y
U
p
G
a
O
'
D
g
d
o
-
S
Z
h
X
f
D
O
v
y
.
,
z
B
I
t
2
i
6
k
Q
x
P
N
7
K
J
r
u
Y
b
A
V
p
E
W
\163
R
Non-zero for 18.5% of words.
^Gord on$____ (3.6) ^ord er$____(3.2) ^ord ered$____(3.2) ^ord ers$____(3.2) ^ord inary$____(3.2) ^Lord $____ (3.2) ^Lord s$____ (3.2) ^bord er$____ (3.1) ^bord ers$____ (3.1) ^cross-bord er$____ (3.1) ^bord ering$____ (3.1) ^landlord $____ (3.0) ^landlord s$____ (3.0) ^accord ing$____ (2.7) ^record $____ (2.7) ^Accord ing$____ (2.7) ^record s$____ (2.7) ^Jord an$____ (2.7) ^Ord er$____(2.4) ^XVI$ ____ (2.3) ^Gore $____ (2.3) ^pru dent$____(2.2) ^involved $____ (2.2) ^resolved $____ (2.2) ^evolved $____ (2.2) ^solved $____ (2.2) ^pre sident$____(2.1) ^pre sidential$____(2.1) ^pre vious$____(2.1) ^pre ssure$____(2.1) ^pre ss$____(2.1) ^pre viously$____(2.1) ^pre vent$____(2.1) ^pre sent$____(2.1) ^extraord inary$____ (2.1) ^extraord inarily$____ (2.1) ^cru de$____(2.1) ^cru cial$____(2.1) ^cru nch$____(2.1) ^cru ise$____(2.1) Filter 48 (bias = -0.80) #
.
v
'
J
Q
B
/
T
C
E
s
0
r
i
<BOS>
6
y
9
A
e
c
4
m
2
n
1
O
u
h
H
3
q
j
k
X
s
Z
x
J
f
D
.
T
S
P
t
0
h
M
-
1
'
K
a
9
\$
)
F
Y
g
6
d
7
o
H
y
2
w
8
u
G
p
\195
A
d
Z
f
A
'
J
x
L
v
N
p
G
s
K
-
\195
t
Y
\194
r
k
H
,
E
m
B
y
X
i
3
"
M
C
g
S
0
c
R
F
w
v
L
q
l
W
g
B
m
N
G
o
A
k
D
9
s
x
F
"
j
R
d
w
U
\194
Z
n
P
-
S
f
z
'
u
T
r
Non-zero for 16.6% of words.
^Xav ier$____(2.7) ^Tsv angirai$____(2.6) ^N.J.$ ____ (2.5) ^Xin hua$____(2.5) ^Xin jiang$____(2.5) ^D-N .Y.$____(2.4) ^CDs$ ____ (2.4) ^Juv entus$____(2.4) ^Jav ier$____(2.3) ^Div ision$____(2.2) ^Ph.D.$ ____ (2.2) ^Xbo x$____(2.1) ^Jak e$____(2.1) ^Jak arta$____(2.1) ^DAX$_ ___ (2.1) ^Jin tao$____(2.1) ^Dav id$____(2.1) ^Dav is$____(2.1) ^Dav e$____(2.1) ^Dav ies$____(2.1) ^Dav ydenko$____(2.1) ^Dav idson$____(2.1) ^FOX$_ ___ (2.0) ^Md$ ____(2.0) ^Zoo $____(2.0) ^Pfi zer$____(2.0) ^X$_ ___(1.9) ^Jun e$____(1.9) ^Jun ior$____(1.9) ^TSB $____(1.9) ^Dix on$____(1.9) ^Zur ich$____(1.9) ^CDC$ ____ (1.9) ^'Don nell$____ (1.9) ^Jan uary$____(1.9) ^Jan $____(1.9) ^Jan e$____(1.9) ^Jan et$____(1.9) ^Zar dari$____(1.9) ^Duk e$____(1.9) Filter 49 (bias = -0.56) #
<BOS>
y
E
r
2
h
w
m
W
f
v
F
9
A
z
p
6
t
-
l
J
H
X
L
4
n
5
C
\194
k
I
P
0
c
Q
D
3
,
"
T
x
k
L
U
5
R
6
m
h
'
l
C
7
;
\$
u
e
s
8
r
q
T
0
z
.
P
N
t
4
i
F
M
X
O
9
)
2
-
D
w
b
t
V
I
m
D
Y
r
.
P
X
j
x
d
Z
u
W
1
G
F
Q
R
"
T
\194
,
'
f
z
N
$
o
g
n
L
i
w
H
e
w
d
W
P
E
D
I
x
2
f
O
F
A
r
i
p
S
v
t
l
4
L
s
h
3
c
$
b
u
C
R
n
y
m
Non-zero for 19.7% of words.
^DENVE R$____ (2.8) ^webs ite$____ (2.3) ^webs ites$____ (2.3) ^Webs ter$____ (2.3) ^web$ ____ (2.2) ^Web$ ____ (2.2) ^Webe r$____ (2.1) ^Zimbabw e$____ (2.0) ^Zimbabw ean$____ (2.0) ^Exxo n$____ (1.9) ^Law $____(1.8) ^Law rence$____(1.8) ^Law yers$____(1.8) ^Law makers$____(1.8) ^moveme nt$____ (1.7) ^improveme nt$____ (1.7) ^involveme nt$____ (1.7) ^improveme nts$____ (1.7) ^FedEx$_ ___ (1.7) ^vomi ting$____ (1.6) ^L.A .$____(1.6) ^Elvi s$____ (1.6) ^law $____(1.6) ^law yer$____(1.6) ^law s$____(1.6) ^law makers$____(1.6) ^Kashmi r$____ (1.6) ^emi ssions$____(1.6) ^emi ssion$____(1.6) ^XVI $____(1.6) ^5.2 $____(1.5) ^6.2 $____(1.5) ^Jami e$____ (1.5) ^Peshaw ar$____ (1.5) ^Norwegi an$____ (1.5) ^50m$ ____ (1.5) ^Jobs $____ (1.5) ^Elizabe th$____ (1.5) ^100m$ ____ (1.5) ^200m$ ____ (1.5) Filter 50 (bias = -0.51) #
B
-
Q
g
k
d
U
v
C
w
V
e
K
o
R
E
A
j
Z
i
P
u
Y
p
<BOS>
.
F
0
/
x
b
h
X
t
,
D
r
c
"
2
B
y
v
O
b
p
9
S
\194
E
V
e
k
g
n
o
R
G
q
%
Z
f
)
i
Q
r
C
m
X
-
7
s
6
t
a
j
z
M
&
h
l
u
X
h
Q
y
b
H
K
c
p
U
P
1
B
R
e
g
z
C
f
n
\194
s
V
-
m
i
L
o
I
3
S
4
O
k
6
d
/
M
K
g
"
j
Q
d
f
v
'
l
O
T
N
D
/
0
S
h
s
p
U
i
$
q
W
H
M
1
R
t
,
e
o
I
y
J
A
n
Non-zero for 22.6% of words.
^Blo omberg$____(3.0) ^Blo od$____(3.0) ^Cal$ ____ (2.8) ^UBS$ ____ (2.7) ^CBI$ ____ (2.5) ^blo od$____(2.5) ^blo ck$____(2.5) ^blo w$____(2.5) ^blo g$____(2.5) ^Baby $____ (2.5) ^CBS$ ____ (2.5) ^RBIs $____ (2.5) ^RBI$ ____ (2.5) ^RBS$ ____ (2.4) ^global$ ____ (2.4) ^Global$ ____ (2.4) ^tribal$ ____ (2.4) ^verbal$ ____ (2.4) ^Abbo tt$____ (2.4) ^Bef ore$____(2.4) ^Half $____ (2.4) ^Buff alo$____ (2.3) ^Buff ett$____ (2.3) ^liberals $____ (2.3) ^generals $____ (2.3) ^several$ ____ (2.3) ^federal$ ____ (2.3) ^general$ ____ (2.3) ^central$ ____ (2.3) ^weaknes s$____ (2.3) ^darknes s$____ (2.3) ^weaknes ses$____ (2.3) ^PBS$ ____ (2.3) ^BlackBer ry$____ (2.2) ^Kabul$ ____ (2.2) ^Istanbul$ ____ (2.2) ^FBI$ ____ (2.2) ^tablo id$____ (2.2) ^Pablo $____ (2.2) ^reserves $____ (2.2) Filter 51 (bias = -0.55) #
K
u
Q
v
G
-
X
w
<BOS>
t
O
n
P
h
S
q
L
k
Y
.
l
H
/
g
8
T
p
i
7
c
"
e
V
j
6
a
^
N
z
d
R
l
u
m
U
L
s
p
k
X
r
D
-
.
I
K
E
e
9
x
)
t
"
c
3
f
?
M
H
5
\195
v
C
T
Q
6
(
0
'
j
w
F
W
f
G
v
g
x
O
k
X
B
y
u
Z
R
Q
j
Y
s
/
P
K
9
2
d
$
J
H
l
m
b
1
t
C
h
U
I
h
Q
y
q
c
X
G
t
g
a
u
.
s
A
x
W
M
l
C
N
m
w
i
/
f
\194
F
o
k
U
j
4
0
Non-zero for 15.1% of words.
^Kuwa it$____ (3.7) ^TORON TO$____ (3.0) ^Rwa nda$____(3.0) ^Suga r$____ (2.9) ^Guy$ ____ (2.7) ^Kuma r$____ (2.6) ^Rya n$____(2.5) ^Rya nair$____(2.5) ^Orga nization$____ (2.4) ^Orga nisation$____ (2.4) ^ugl y$____(2.4) ^swa p$____(2.4) ^swa ps$____(2.4) ^swa y$____(2.4) ^plug$ ____ (2.3) ^Uga nda$____(2.2) ^Quit e$____ (2.2) ^Quot e$____ (2.1) ^YORK$ ____ (2.0) ^Guil d$____ (1.9) ^swe et$____(1.9) ^swe pt$____(1.9) ^swe eping$____(1.9) ^swe ep$____(1.9) ^Guat emala$____ (1.8) ^Kurt $____ (1.8) ^Delawa re$____ (1.8) ^DETROI T$____ (1.8) ^Sky$ ____ (1.8) ^TOKYO$ ____ (1.7) ^lawl ess$____ (1.7) ^suga r$____ (1.7) ^MOSCOW $____ (1.7) ^VEGA S$____ (1.7) ^Qual ity$____ (1.6) ^LPGA $____ (1.6) ^Ryd er$____(1.6) ^awa y$____(1.6) ^awa rd$____(1.6) ^awa re$____(1.6) Filter 52 (bias = -0.41) #
X
x
Q
v
Z
s
E
n
e
k
r
i
H
o
y
C
L
f
M
'
"
t
D
R
\163
S
<BOS>
B
O
,
G
z
.
-
N
l
2
c
P
u
D
k
(
b
I
p
X
m
2
x
N
s
5
f
6
r
/
'
7
U
\194
g
1
G
H
R
\$
P
&
V
t
y
L
)
Q
h
q
c
W
;
U
-
a
j
K
o
Q
g
,
u
A
v
8
J
W
M
B
l
P
.
y
t
2
r
C
h
Z
e
X
T
6
R
L
n
F
'
z
w
/
D
l
c
X
d
Y
u
L
r
V
y
6
C
H
v
W
g
7
D
\194
-
4
k
5
U
S
s
J
\163
/
t
B
p
Q
z
b
T
3
R
o
Non-zero for 16.9% of words.
^Dal las$____(2.5) ^Dal ai$____(2.5) ^Dal e$____(2.5) ^Dal y$____(2.5) ^metal $____ (2.4) ^metal s$____ (2.4) ^retal iation$____ (2.4) ^mortal ity$____ (2.4) ^portal $____ (2.4) ^Zeal and$____ (2.4) ^real ly$____ (2.3) ^real $____ (2.3) ^real ity$____ (2.3) ^real ize$____ (2.3) ^Heal th$____ (2.3) ^Heal thcare$____ (2.3) ^medal $____ (2.2) ^medal s$____ (2.2) ^Medal $____ (2.2) ^pedal $____ (2.2) ^medal ist$____ (2.2) ^penal ty$____ (2.2) ^Arsenal $____ (2.2) ^penal ties$____ (2.2) ^arsenal $____ (2.2) ^Journal $____ (2.2) ^journal ists$____ (2.2) ^internal $____ (2.2) ^journal ist$____ (2.2) ^renewal $____ (2.1) ^sidewal k$____ (2.1) ^Deal $____ (2.1) ^DAX $____(2.1) ^Hal l$____(2.1) ^Hal f$____(2.1) ^Hal loween$____(2.1) ^Hal ifax$____(2.1) ^tal ks$____(2.0) ^tal k$____(2.0) ^tal king$____(2.0) Filter 53 (bias = -0.53) #
.
k
Z
B
/
f
L
T
G
v
g
t
X
P
Q
i
m
E
<EOS>
p
^
F
Y
e
<BOS>
u
A
r
n
h
V
q
I
R
x
J
;
g
P
j
W
.
K
-
B
c
U
h
p
D
k
o
a
n
Q
u
X
0
,
l
f
?
"
G
b
(
i
d
:
t
I
C
V
A
6
s
H
-
V
o
T
f
M
O
Z
p
m
r
X
s
Y
I
B
w
h
n
k
l
F
R
b
N
L
z
6
d
U
x
4
,
\194
\195
"
J
W
'
Q
w
p
u
r
H
R
U
'
i
"
m
P
A
b
t
x
L
G
M
S
y
V
E
7
n
C
W
X
a
9
h
Y
_
k
B
\194
1
8
2
Non-zero for 17.4% of words.
^Lamp ard$____ (2.9) ^BHP $____(2.6) ^AZUZ$ ____ (2.6) ^PHILADELPHI A$____ (2.5) ^Lamb ert$____ (2.5) ^amp le$____(2.4) ^Zimb abwe$____ (2.4) ^Zimb abwean$____ (2.4) ^Gabr iel$____ (2.3) ^phr ase$____(2.3) ^phr ases$____(2.3) ^PM$ ____(2.3) ^WHO $____(2.3) ^PBS $____(2.3) ^gamb ling$____ (2.3) ^gamb le$____ (2.3) ^Kyr gyzstan$____(2.2) ^WTO $____(2.2) ^Byr d$____(2.1) ^Byr ne$____(2.1) ^Byr on$____(2.1) ^PAR IS$____(2.1) ^Ukr aine$____(2.1) ^Ukr ainian$____(2.1) ^Lavr ov$____ (2.1) ^BT$ ____(2.1) ^Camp bell$____ (2.1) ^Camp $____ (2.1) ^Camp aign$____ (2.1) ^Limb augh$____ (2.1) ^imp ortant$____(2.0) ^imp act$____(2.0) ^imp rove$____(2.0) ^imp roved$____(2.0) ^Libr ary$____ (2.0) ^BBC $____(2.0) ^amb assador$____(2.0) ^amb itious$____(2.0) ^amb ulance$____(2.0) ^amb itions$____(2.0) Filter 54 (bias = -0.55) #
G
.
Y
q
K
u
O
h
P
e
z
v
C
N
V
r
/
-
S
x
l
a
i
w
X
b
m
E
,
n
U
F
^
f
5
\163
p
g
\194
d
!
T
Z
o
.
r
L
R
A
O
m
j
X
E
V
-
n
t
/
u
w
S
?
h
a
p
Q
k
:
v
&
;
6
f
b
J
5
e
K
i
t
u
S
L
I
b
p
U
Q
.
W
J
,
\195
X
-
O
v
i
o
C
x
$
N
\194
r
V
a
R
h
Z
B
m
E
a
-
B
j
A
g
U
c
L
n
l
M
m
u
z
R
b
o
K
C
P
h
X
'
,
0
W
4
\195
1
/
3
T
r
Q
9
p
7
k
d
Non-zero for 26.4% of words.
^Onta rio$____ (2.8) ^Zim babwe$____(2.7) ^Zim babwean$____(2.7) ^mainta in$____ (2.5) ^mainta ined$____ (2.5) ^mainta ining$____ (2.5) ^mainta ins$____ (2.5) ^impa ct$____ (2.5) ^impa irment$____ (2.5) ^impa cts$____ (2.5) ^impa cted$____ (2.5) ^impa sse$____ (2.5) ^impa ired$____ (2.5) ^LSU $____(2.4) ^Apa rt$____(2.4) ^Atl anta$____(2.3) ^Atl antic$____(2.3) ^Atl antis$____(2.3) ^jointl y$____ (2.3) ^appointm ent$____ (2.3) ^disappointm ent$____ (2.3) ^appointm ents$____ (2.3) ^Lia m$____(2.2) ^simpl y$____ (2.2) ^simpl e$____ (2.2) ^impl ement$____ (2.2) ^impl ications$____ (2.2) ^Virginia $____ (2.2) ^Palestinia n$____ (2.2) ^Palestinia ns$____ (2.2) ^Ukrainia n$____ (2.2) ^Israeli-Palestinia n$____ (2.2) ^Via com$____(2.1) ^Zea land$____(2.1) ^simil ar$____ (2.0) ^simil arly$____ (2.0) ^Simil arly$____ (2.0) ^Simil ar$____ (2.0) ^AOL $____(2.0) ^Lil ly$____(2.0) Filter 55 (bias = -0.43) #
x
w
8
-
9
g
6
t
\194
r
7
H
Q
u
F
m
"
A
X
i
v
n
5
y
0
.
V
o
<BOS>
I
B
O
S
M
4
T
b
\195
P
E
.
U
(
P
j
k
-
m
\$
B
N
p
?
b
g
;
S
z
o
)
n
T
5
K
\194
%
:
a
t
y
w
\195
&
f
h
i
Q
u
/
v
W
f
w
F
Q
j
z
h
X
l
"
n
G
u
$
P
E
d
O
r
K
D
Y
H
2
L
Z
p
\194
y
x
i
C
m
t
p
N
g
B
G
f
i
u
X
F
1
R
V
v
P
t
-
U
m
o
d
.
Y
9
2
s
H
M
l
,
y
\194
O
x
7
c
Z
k
W
K
Non-zero for 14.2% of words.
^ex-wi fe$____ (4.5) ^2008.$_ ___ (3.1) ^vowi ng$____ (3.1) ^6.2$ ____ (2.9) ^2009.$_ ___ (2.9) ^7.2$ ____ (2.7) ^2007.$_ ___ (2.7) ^award-wi nning$____ (2.7) ^8.5$ ____ (2.4) ^Swi ss$____(2.4) ^Swi tzerland$____(2.4) ^Swi ft$____(2.4) ^Q.$_ ___ (2.4) ^6-2$ ____ (2.3) ^6.8$ ____ (2.3) ^F.$_ ___ (2.3) ^owi ng$____(2.3) ^vowe d$____ (2.3) ^6.3$ ____ (2.3) ^Bowl $____ (2.3) ^5.2$ ____ (2.2) ^9.5$ ____ (2.2) ^6.6$ ____ (2.2) ^next$_ ___ (2.2) ^text$_ ___ (2.2) ^Next$_ ___ (2.2) ^context$_ ___ (2.2) ^6.5$ ____ (2.2) ^Gov.$_ ___ (2.2) ^Rev.$_ ___ (2.2) ^v.$_ ___ (2.2) ^0.2$ ____ (2.2) ^6.4$ ____ (2.2) ^twi ce$____(2.1) ^twi n$____(2.1) ^twi ns$____(2.1) ^twi st$____(2.1) ^85$_ ___ (2.1) ^1985$_ ___ (2.1) ^Vauxhal l$____ (2.1) Filter 56 (bias = -0.50) #
K
v
o
j
/
T
<EOS>
k
W
d
3
t
O
p
L
e
U
q
,
F
Z
g
5
b
w
r
N
D
s
P
Y
\163
G
-
S
c
y
h
4
I
W
-
X
f
Q
n
E
l
2
p
"
j
(
d
w
'
K
s
4
C
N
P
Z
m
?
u
B
r
&
x
8
R
6
o
3
k
q
g
A
%
Q
m
d
M
7
w
8
i
\194
f
X
u
D
o
"
k
6
-
9
n
I
H
0
B
\163
h
2
J
C
t
p
g
q
j
1
y
$
s
a
K
y
-
c
u
K
J
p
l
C
j
G
v
8
Y
F
b
O
R
,
H
Z
i
Q
\195
f
T
P
q
D
h
"
k
\163
B
/
I
U
t
2
E
Non-zero for 16.0% of words.
^Way ne$____(3.6) ^Way $____(3.6) ^Wac hovia$____(3.4) ^Lady $____ (3.1) ^broadc ast$____ (3.1) ^broadc aster$____ (3.1) ^broadc asting$____ (3.1) ^broadc asters$____ (3.1) ^Galloway $____ (2.7) ^Alloway $____ (2.7) ^Eac h$____(2.6) ^Kennedy $____ (2.5) ^Why $____(2.5) ^sway $____ (2.4) ^Exc hange$____(2.4) ^Exc luding$____(2.4) ^Exc ept$____(2.4) ^Exc ellence$____(2.4) ^anyway $____ (2.3) ^Anyway $____ (2.3) ^comedy $____ (2.3) ^Comedy $____ (2.3) ^remedy $____ (2.3) ^Radc liffe$____ (2.3) ^slowly $____ (2.2) ^narrowly $____ (2.2) ^lady $____ (2.2) ^way $____(2.2) ^way s$____(2.2) ^Edd ie$____(2.2) ^Kay $____(2.2) ^away $____ (2.1) ^breakaway $____ (2.1) ^runaway $____ (2.1) ^Hathaway $____ (2.1) ^EDF $____(2.0) ^Bay $____(2.0) ^Bay ern$____(2.0) ^Bay lor$____(2.0) ^runway $____ (2.0) Filter 57 (bias = -0.47) #
H
c
L
p
Y
k
A
f
X
v
/
x
l
s
Z
C
M
'
6
F
W
r
u
P
1
\163
m
d
J
z
^
S
h
g
4
R
3
t
N
b
k
E
C
L
V
.
n
-
i
%
'
e
\194
o
B
D
Q
u
Y
d
W
l
)
O
I
y
,
N
;
r
7
J
p
G
X
\195
H
f
4
w
U
q
s
n
m
p
Y
d
O
v
z
c
K
F
G
C
L
h
E
0
u
x
M
1
R
D
S
j
"
t
o
7
/
I
$
e
-
g
2
Q
h
R
y
Z
u
r
m
'
H
X
L
$
d
9
i
I
t
"
T
V
x
N
l
G
v
K
F
C
D
O
e
/
a
A
f
_
Non-zero for 16.2% of words.
^His$ ____ (3.6) ^ACOR N$____ (3.5) ^talks$ ____ (3.4) ^folks$ ____ (3.4) ^walks$ ____ (3.4) ^Talks$ ____ (3.4) ^Volksw agen$____ (3.2) ^HBO$ ____ (3.1) ^Liz$ ____ (3.1) ^LSU$ ____ (3.0) ^Liu$ ____ (2.8) ^km$ ____(2.7) ^Lisb on$____ (2.6) ^Hisp anic$____ (2.6) ^Hisp anics$____ (2.6) ^Limb augh$____ (2.6) ^Libr ary$____ (2.5) ^Has$ ____ (2.5) ^Lib$ ____ (2.4) ^Ham$ ____ (2.4) ^Israelis$ ____ (2.4) ^Indianapolis$ ____ (2.4) ^Ellis$ ____ (2.4) ^Minneapolis$ ____ (2.4) ^LLC$_ ___ (2.3) ^PLC$_ ___ (2.3) ^Las$ ____ (2.3) ^Lion s$____ (2.3) ^Lion el$____ (2.3) ^Lion $____ (2.3) ^AZUZ $____ (2.3) ^Muslim$ ____ (2.3) ^slim$ ____ (2.3) ^LCD$ ____ (2.2) ^HBOS $____ (2.2) ^HIV$ ____ (2.2) ^Falkir k$____ (2.1) ^breaks$ ____ (2.1) ^speaks$ ____ (2.1) ^outbreaks$ ____ (2.1) Filter 58 (bias = -0.55) #
<BOS>
t
Q
f
X
y
V
A
Y
a
Z
o
\194
h
"
p
^
i
7
s
9
w
R
e
8
E
6
c
'
x
F
d
,
g
S
n
U
h
E
H
P
i
F
q
z
-
s
1
T
o
k
Y
c
W
r
x
\163
5
G
7
)
4
f
l
K
(
D
:
e
6
O
3
;
\194
b
V
w
b
u
Q
E
Y
t
k
e
X
o
'
N
"
H
p
y
P
i
G
n
\194
A
C
f
R
2
z
-
$
h
7
s
x
1
v
3
9
,
C
E
c
M
n
m
d
-
1
J
D
e
A
b
p
w
7
u
8
f
5
T
,
B
0
j
x
W
/
U
.
v
a
k
y
i
h
K
g
\195
Non-zero for 18.9% of words.
^1bn $____(3.6) ^NYC $____(3.3) ^IPC C$____(2.5) ^unkn own$____ (2.5) ^orc hestra$____(2.4) ^orc hestrated$____(2.4) ^ICC $____(2.3) ^Zhan g$____ (2.3) ^occ urred$____(2.3) ^occ ur$____(2.3) ^occ asion$____(2.3) ^occ upied$____(2.3) ^occ asionally$____(2.3) ^occ asions$____(2.3) ^MPC $____(2.3) ^Viac om$____ (2.2) ^opp osition$____(2.2) ^opp ortunity$____(2.2) ^opp ortunities$____(2.2) ^opp osed$____(2.2) ^opp onents$____(2.2) ^opp osite$____(2.2) ^opp onent$____(2.2) ^opp ose$____(2.2) ^Abd ullah$____(2.2) ^Abd ul$____(2.2) ^Abd ulmutallab$____(2.2) ^Vikt or$____ (2.1) ^ign ored$____(2.1) ^ign ore$____(2.1) ^ign oring$____(2.1) ^ign orance$____(2.1) ^ign orant$____(2.1) ^ign ited$____(2.1) ^obl igations$____(2.1) ^obl igation$____(2.1) ^obl iged$____(2.1) ^hac kers$____(2.1) ^hac king$____(2.1) ^hac ked$____(2.1) Filter 59 (bias = -0.53) #
<BOS>
d
O
P
o
p
N
b
W
m
S
k
w
a
Q
U
"
F
M
v
E
x
^
L
3
l
K
u
/
C
Y
f
h
n
D
B
U
j
y
k
!
f
L
-
Z
t
z
v
a
r
G
J
&
S
K
h
?
l
/
b
1
e
D
F
2
x
A
i
8
;
c
q
d
R
Q
'
m
o
b
p
V
c
w
x
E
C
T
n
M
R
X
8
U
y
H
r
A
1
t
9
.
N
Z
,
e
O
B
0
W
d
k
3
z
f
v
7
Q
T
'
D
"
y
V
h
-
c
$
A
\194
B
S
1
X
o
b
t
R
u
H
w
_
i
L
a
0
n
U
Non-zero for 26.3% of words.
^SUV$ ____ (3.2) ^SEOUL$ ____ (3.1) ^Sam$ ____ (2.9) ^Sams ung$____ (2.6) ^Symp hony$____ (2.5) ^unemployme nt$____ (2.4) ^employme nt$____ (2.4) ^deployme nt$____ (2.4) ^Employme nt$____ (2.4) ^am$ ____(2.4) ^NASDAQ $____ (2.4) ^Nyme x$____ (2.4) ^U.S .$____(2.3) ^U.S .-led$____(2.3) ^U.S .-backed$____(2.3) ^UBS $____(2.3) ^amb assador$____(2.3) ^amb itious$____(2.3) ^amb ulance$____(2.3) ^amb itions$____(2.3) ^amb ition$____(2.3) ^amb ush$____(2.3) ^amb assadors$____(2.3) ^Moyes $____ (2.2) ^Oak$ ____ (2.2) ^1m$ ____(2.2) ^2m$ ____(2.1) ^Am$ ____(2.1) ^boys$ ____ (2.1) ^toys$ ____ (2.1) ^employs$ ____ (2.1) ^Cowboys$ ____ (2.1) ^enjoys$ ____ (2.1) ^Boys$ ____ (2.1) ^boat$ ____ (2.1) ^throat$ ____ (2.1) ^coat$ ____ (2.1) ^float$ ____ (2.1) ^employer s$____ (2.1) ^employer $____ (2.1) Filter 60 (bias = -0.66) #
R
y
<BOS>
e
C
h
V
m
9
T
Q
W
'
t
n
E
I
p
7
w
k
O
J
o
<EOS>
\163
Z
L
\194
c
s
a
-
g
P
i
\195
D
r
q
x
-
F
T
8
u
Q
o
\$
z
9
J
7
%
B
O
6
g
q
t
f
l
b
i
"
j
V
m
X
D
h
E
a
w
W
G
4
U
!
\195
H
f
I
x
-
S
1
s
u
F
Z
K
C
o
R
B
d
E
D
e
T
L
n
h
P
O
g
,
X
N
\195
y
q
5
7
b
Y
l
Q
m
S
-
F
J
M
g
t
\195
/
q
Q
p
f
v
m
b
'
u
,
a
C
1
\194
0
$
d
K
r
"
i
s
w
y
o
E
2
x
Non-zero for 19.8% of words.
^RBI$ ____ (3.2) ^RBIs $____ (3.1) ^FIF A$____(2.5) ^Reut ers$____ (2.5) ^CBI$ ____ (2.4) ^Radc liffe$____ (2.4) ^1981$ ____ (2.3) ^Ranc h$____ (2.3) ^FTS E$____(2.2) ^Cant erbury$____ (2.2) ^Red$ ____ (2.1) ^Fut ure$____(2.1) ^Fut ures$____(2.1) ^1991$ ____ (2.0) ^Reds kins$____ (2.0) ^Reds $____ (2.0) ^1971$ ____ (2.0) ^1961$ ____ (1.9) ^Cart er$____ (1.9) ^McCart hy$____ (1.9) ^McCart ney$____ (1.9) ^astronaut s$____ (1.9) ^astronaut $____ (1.9) ^unaut horized$____ (1.9) ^Can$ ____ (1.9) ^NHS $____(1.9) ^F1$ ____(1.9) ^1941$ ____ (1.9) ^1987$ ____ (1.8) ^Raul $____ (1.8) ^Cany on$____ (1.8) ^BERLIN $____ (1.8) ^Chuc k$____ (1.8) ^Cauc asus$____ (1.8) ^1951$ ____ (1.7) ^FCC $____(1.7) ^1982$ ____ (1.7) ^BCS $____(1.7) ^FC$ ____(1.7) ^81$ ____(1.7) Filter 61 (bias = -0.59) #
<BOS>
u
X
h
V
o
Q
w
P
a
C
y
j
N
G
H
\194
-
D
i
7
q
p
U
l
n
0
A
z
W
F
x
6
.
^
f
'
E
Z
B
H
p
J
c
u
y
-
x
I
T
4
m
j
v
3
f
2
K
Z
b
E
o
w
\163
i
r
s
k
5
O
(
"
n
'
7
z
R
G
1
t
M
C
m
I
W
R
K
k
f
g
L
r
X
p
e
P
B
n
\194
d
x
1
N
U
.
i
v
s
"
z
o
A
6
a
S
c
/
u
b
\195
C
v
n
E
Z
B
/
x
.
e
'
w
Q
T
H
p
y
W
$
J
L
i
M
b
r
a
R
0
A
z
m
o
D
2
q
\163
k
Non-zero for 27.6% of words.
^Quen tin$____ (2.7) ^FRANCISC O$____ (2.5) ^column $____ (2.4) ^column ist$____ (2.4) ^column s$____ (2.4) ^Vien na$____ (2.3) ^influen ce$____ (2.2) ^influen tial$____ (2.2) ^influen ced$____ (2.2) ^influen za$____ (2.2) ^affluen t$____ (2.2) ^influen ces$____ (2.2) ^Hen ry$____(2.2) ^Hen derson$____(2.2) ^Hen in$____(2.2) ^Hen rik$____(2.2) ^Jen nifer$____(2.2) ^Jen kins$____(2.2) ^Jen ny$____(2.2) ^Jen nings$____(2.2) ^HMR C$____(2.1) ^Gwen $____ (2.1) ^Puer to$____ (2.0) ^D-N. Y.$____ (2.0) ^Hon g$____(2.0) ^Hon da$____(2.0) ^Hon duras$____(2.0) ^Hon duran$____(2.0) ^Jon es$____(2.0) ^Jon athan$____(2.0) ^Jon $____(2.0) ^Jon g$____(2.0) ^Jon as$____(2.0) ^Jon ny$____(2.0) ^asylum$ ____ (1.9) ^curriculum$ ____ (1.9) ^slum$ ____ (1.9) ^Guer nsey$____ (1.9) ^IMF $____(1.9) ^recipien ts$____ (1.8) Filter 62 (bias = -0.50) #
p
.
P
h
i
F
X
N
<BOS>
c
O
u
I
x
K
A
V
s
m
n
z
L
J
g
T
y
Y
o
W
a
G
f
^
S
l
r
Q
e
E
v
w
x
f
b
y
Y
%
\194
O
h
U
V
K
0
m
"
n
9
t
7
,
)
I
k
o
8
-
q
/
Q
i
c
s
X
N
6
M
g
A
Q
h
R
f
I
o
r
x
P
S
U
i
z
M
\195
y
"
m
b
t
X
e
Z
5
d
j
$
F
C
K
'
w
a
4
9
L
k
,
7
B
Z
p
M
z
H
O
.
c
X
t
-
T
m
o
V
S
w
P
/
k
W
l
n
C
Q
d
4
s
N
G
u
a
$
0
6
i
"
,
L
D
Non-zero for 19.8% of words.
^Sullivan $____ (2.4) ^XVI$ ____ (2.3) ^TIM E$____(2.3) ^six-m onth$____ (2.2) ^elephan t$____ (2.1) ^Stephan ie$____ (2.1) ^elephan ts$____ (2.1) ^Ivan $____ (2.1) ^Ivan ovic$____ (2.1) ^umbre lla$____ (2.0) ^bru tal$____(2.0) ^bru sh$____(2.0) ^bru ised$____(2.0) ^bru ising$____(2.0) ^Taliban $____ (1.9) ^Miliban d$____ (1.9) ^Toshiba$ ____ (1.9) ^Caribbe an$____ (1.9) ^tribun al$____ (1.8) ^Tribun e$____ (1.8) ^van $____(1.8) ^van ished$____(1.8) ^Pennsylvan ia$____ (1.8) ^Obam a$____ (1.8) ^Obam as$____ (1.8) ^Malibu$ ____ (1.8) ^bre ak$____(1.8) ^bre aking$____(1.8) ^bre ast$____(1.8) ^bre aks$____(1.8) ^bre akfast$____(1.8) ^bre ach$____(1.8) ^bre athing$____(1.8) ^bre akthrough$____(1.8) ^Silva$ ____ (1.8) ^Rihan na$____ (1.8) ^Thre e$____ (1.7) ^vs. $____(1.7) ^record-bre aking$____ (1.7) ^JPM organ$____(1.6) Filter 63 (bias = -0.43) #
N
g
K
-
L
p
/
k
Q
v
X
j
B
i
"
b
8
d
,
s
W
x
O
w
<BOS>
'
\194
n
D
V
M
c
U
m
6
G
^
z
9
C
B
-
5
!
W
g
6
r
x
U
N
m
4
d
9
u
K
z
S
.
3
?
8
G
2
s
f
y
v
P
0
'
F
\195
\194
R
X
p
7
%
g
f
Y
p
j
y
T
,
l
a
.
x
M
P
V
K
\194
2
G
i
D
W
R
n
b
3
A
F
u
1
J
d
0
8
-
U
$
s
m
w
Q
h
X
x
$
f
O
n
\194
B
I
y
S
u
G
a
Y
q
"
v
7
b
w
c
k
.
m
N
o
A
r
Non-zero for 13.1% of words.
^NBA$ ____ (3.8) ^NBC$ ____ (3.3) ^MSNBC$ ____ (3.3) ^CNBC$ ____ (3.3) ^BT$ ____(3.2) ^TOKYO $____ (3.1) ^NY$ ____(3.1) ^No.$ ____ (3.1) ^NYS E$____(3.0) ^B.$ ____(3.0) ^TORONTO $____ (2.9) ^WTO $____(2.8) ^BST$ ____ (2.8) ^NOT$ ____ (2.8) ^Big$ ____ (2.7) ^BOSTO N$____ (2.6) ^Net$ ____ (2.6) ^NFL$ ____ (2.6) ^W.$ ____(2.6) ^Kell y$____ (2.6) ^Kell er$____ (2.6) ^Dog$ ____ (2.6) ^NATO $____ (2.5) ^OFT$ ____ (2.5) ^Nels on$____ (2.5) ^BA$ ____(2.4) ^Lt.$ ____ (2.4) ^Not$ ____ (2.4) ^HOUSTO N$____ (2.4) ^N.$ ____(2.4) ^Nov$ ____ (2.4) ^850$ ____ (2.3) ^LONDO N$____ (2.3) ^5.7 $____(2.3) ^NFC$ ____ (2.3) ^NYC $____(2.3) ^BBC$ ____ (2.3) ^BAG HDAD$____(2.3) ^Legi slature$____ (2.3) ^JERUSALEM$ ____ (2.3) Filter 64 (bias = -0.51) #
-
x
I
v
Z
c
X
f
g
B
H
o
V
y
j
h
n
T
<BOS>
k
l
t
J
N
7
U
i
S
1
K
2
"
/
F
^
u
P
a
6
s
X
u
F
w
V
E
f
-
P
a
\194
U
l
s
m
g
C
?
Q
i
M
o
K
I
p
H
j
A
'
h
6
2
/
z
7
1
;
W
S
q
C
f
G
e
c
h
R
H
Q
l
z
m
V
t
Z
L
'
u
g
i
k
q
9
y
8
a
r
-
0
E
"
W
$
T
7
B
Y
d
F
K
g
x
j
B
d
f
D
N
I
o
H
W
t
"
i
b
T
/
1
Q
C
\194
e
a
E
L
-
9
p
Y
0
,
2
'
c
.
G
8
\163
Non-zero for 19.3% of words.
^McB ride$____(2.7) ^'Co nnor$____(2.5) ^ICC$ ____ (2.4) ^McN amee$____(2.4) ^FC$ ____(2.4) ^PRN ewswire$____(2.4) ^PRN ewswire-FirstCall$____(2.4) ^PRN ewswire-USNewswire$____(2.4) ^Alco hol$____ (2.2) ^AFC$ ____ (2.2) ^Sco tland$____(2.1) ^Sco ttish$____(2.1) ^Sco tt$____(2.1) ^Sco t$____(2.1) ^Inco me$____ (2.1) ^PC$ ____(2.1) ^McQ ueen$____(2.1) ^FRANCISCO $____ (2.0) ^Fro m$____(2.0) ^Fro nt$____(2.0) ^Fro ntier$____(2.0) ^Fro st$____(2.0) ^IPCC$ ____ (2.0) ^QC$ ____(2.0) ^McL aren$____(1.9) ^McL ean$____(1.9) ^welco me$____ (1.9) ^welco med$____ (1.9) ^welco ming$____ (1.9) ^Welco me$____ (1.9) ^Ofco m$____ (1.9) ^confro ntation$____ (1.9) ^confro nted$____ (1.9) ^confro nt$____ (1.9) ^confro nting$____ (1.9) ^high-pro file$____ (1.9) ^non-pro fit$____ (1.9) ^Pc$ ____(1.8) ^inco me$____ (1.7) ^Linco ln$____ (1.7) Filter 65 (bias = -0.35) #
o
F
K
t
G
d
R
m
O
f
z
j
<EOS>
v
N
k
J
p
Y
e
3
'
U
q
9
.
\195
H
L
-
E
h
0
P
w
V
5
\194
B
n
E
m
-
y
I
k
Q
h
:
M
2
F
&
f
z
c
w
T
W
C
(
H
O
B
!
t
9
n
7
D
s
)
3
p
5
P
J
A
"
r
Q
u
X
v
/
w
\194
k
"
g
l
-
Y
i
O
J
$
U
S
E
W
s
n
j
c
e
M
T
m
0
z
x
-
b
t
V
d
h
D
S
I
F
u
B
T
8
U
Y
z
9
O
4
o
7
w
"
P
Q
y
6
n
X
m
k
i
5
_
W
/
3
l
Non-zero for 11.0% of words.
^go-ah ead$____ (2.2) ^two-th irds$____ (2.1) ^Orthodox $____ (1.9) ^disposab le$____ (1.9) ^homosex uality$____ (1.8) ^homosex ual$____ (1.8) ^Exx on$____(1.8) ^growth $____ (1.7) ^Growth $____ (1.7) ^atmosph ere$____ (1.6) ^goalk eeper$____ (1.6) ^broadb and$____ (1.6) ^Bowl$ ____ (1.5) ^bowl$ ____ (1.5) ^DENV ER$____ (1.5) ^knowle dge$____ (1.5) ^acknowle dged$____ (1.5) ^acknowle dge$____ (1.5) ^acknowle dges$____ (1.5) ^Gilb ert$____ (1.5) ^footb all$____ (1.4) ^Footb all$____ (1.4) ^footb aller$____ (1.4) ^oath $____ (1.4) ^goodb ye$____ (1.3) ^ANGELE S$____ (1.3) ^downh ill$____ (1.3) ^PARIS$ ____ (1.3) ^NEW$ ____ (1.3) ^alb um$____(1.3) ^alb eit$____(1.3) ^alb ums$____(1.3) ^Mosle y$____ (1.2) ^Bloomb erg$____ (1.2) ^Wax man$____(1.2) ^two-ye ar$____ (1.2) ^al-Qa ida$____ (1.2) ^al-Qa eda$____ (1.2) ^Al-Qa eda$____ (1.2) ^goal$ ____ (1.2) Filter 66 (bias = -0.45) #
9
E
x
t
n
O
8
w
7
s
q
g
C
A
b
S
Q
i
V
z
B
U
v
T
k
m
P
G
X
o
R
l
6
e
F
y
'
L
\194
u
H
x
Z
.
1
c
i
S
J
f
X
F
2
h
U
g
3
s
M
b
I
r
P
v
W
'
6
\$
;
p
K
t
/
o
Y
j
4
e
\195
\163
b
g
B
d
Q
o
k
1
"
c
X
n
V
D
f
-
F
G
W
z
M
C
\194
0
m
s
Y
w
'
i
q
u
S
2
$
p
T
5
y
z
E
\194
F
v
s
l
h
X
e
m
S
/
r
K
H
o
4
T
u
D
f
c
3
Y
j
0
g
.
y
$
i
n
k
'
U
C
2
\163
Non-zero for 19.6% of words.
^flexibl e$____ (3.4) ^Bibl e$____ (2.9) ^unifo rm$____ (2.5) ^unifo rms$____ (2.5) ^terribl e$____ (2.4) ^horribl e$____ (2.4) ^terribl y$____ (2.4) ^unanimo usly$____ (2.3) ^unanimo us$____ (2.3) ^Publ ic$____ (2.2) ^Daniel $____ (2.2) ^Daniel s$____ (2.2) ^incredibl e$____ (2.2) ^incredibl y$____ (2.2) ^credibl e$____ (2.2) ^flexibi lity$____ (2.2) ^Vikt or$____ (1.9) ^Xbo x$____(1.9) ^publ ic$____ (1.9) ^Republ ican$____ (1.9) ^publ ished$____ (1.9) ^Republ icans$____ (1.9) ^Republ ic$____ (1.9) ^publ icly$____ (1.9) ^1bn $____(1.9) ^rifl e$____ (1.9) ^rifl es$____ (1.9) ^anybo dy$____ (1.8) ^unabl e$____ (1.8) ^reasonabl e$____ (1.8) ^enabl e$____ (1.8) ^sustainabl e$____ (1.8) ^enabl es$____ (1.8) ^enabl ing$____ (1.8) ^HBO S$____(1.8) ^HBO $____(1.8) ^monito ring$____ (1.8) ^monito r$____ (1.8) ^monito rs$____ (1.8) ^monito red$____ (1.8) Filter 67 (bias = -0.68) #
I
m
R
x
<BOS>
f
D
y
Q
h
J
p
7
b
9
.
P
w
0
a
j
o
E
i
2
W
T
L
z
B
d
K
1
c
C
n
\195
F
\194
M
H
c
h
G
W
z
q
O
:
r
i
K
6
o
4
R
u
C
X
D
\194
s
n
p
!
P
m
E
(
S
-
U
V
0
a
g
Y
\163
7
%
F
g
U
p
f
h
s
-
B
G
Q
b
,
r
S
q
M
x
t
i
/
c
\194
n
K
w
N
o
"
1
$
Y
E
y
6
d
9
0
I
a
w
C
Z
s
M
d
X
t
W
x
m
S
K
R
H
c
J
l
V
F
2
'
E
h
3
,
e
f
i
r
4
o
G
D
b
.
6
\194
Y
A
Non-zero for 17.1% of words.
^Rasm ussen$____ (3.0) ^Risi ng$____ (2.8) ^Insi de$____ (2.8) ^Insi ght$____ (2.8) ^Ipsw ich$____ (2.7) ^Inte rnational$____ (2.7) ^Inte rnet$____ (2.7) ^Inte rior$____ (2.7) ^Inte lligence$____ (2.7) ^Hew itt$____(2.6) ^IMF$ ____ (2.3) ^Rutg ers$____ (2.3) ^Dise ase$____ (2.2) ^PRNew swire$____ (2.2) ^PRNew swire-FirstCall$____ (2.2) ^PRNew swire-USNewswire$____ (2.2) ^Hum an$____(2.2) ^Haw aii$____(2.1) ^Haw ks$____(2.1) ^Haw kins$____(2.1) ^Rese arch$____ (2.1) ^Rese rve$____ (2.1) ^Rese archers$____ (2.1) ^dism issed$____ (2.1) ^dism iss$____ (2.1) ^dism issal$____ (2.1) ^dism al$____ (2.1) ^Resi dents$____ (2.1) ^HIV $____(2.0) ^How ever$____(2.0) ^How $____(2.0) ^How ard$____(2.0) ^RAF$ ____ (2.0) ^Insu rance$____ (1.8) ^Risk $____ (1.8) ^terrorism $____ (1.8) ^tourism $____ (1.8) ^counterterrorism $____ (1.8) ^Tourism $____ (1.8) ^IBM$ ____ (1.8) Filter 68 (bias = -0.56) #
G
u
X
B
p
U
<BOS>
k
O
h
Q
v
g
a
7
f
j
t
-
H
^
T
5
A
/
s
0
b
l
F
P
x
m
q
i
N
T
b
t
x
i
!
o
f
u
.
M
F
O
Q
H
:
Y
a
D
L
1
p
W
\$
c
P
h
?
U
X
k
8
0
V
R
Z
v
r
C
d
X
S
d
r
v
j
Z
f
W
R
c
F
6
t
1
s
z
k
2
E
8
h
0
N
p
A
D
u
q
l
\194
o
a
O
G
e
7
M
y
B
"
p
Q
l
W
v
$
b
S
T
4
P
3
k
\194
z
8
m
/
B
s
g
Y
J
r
\195
c
q
t
D
d
n
Non-zero for 33.2% of words.
^God$ ____ (3.6) ^rapid$ ____ (3.4) ^stupid$ ____ (3.4) ^Olympic$ ____ (3.1) ^topic$ ____ (3.1) ^epic$ ____ (3.1) ^god$ ____ (3.0) ^TD$ ____(3.0) ^empty$ ____ (2.9) ^OTC$ ____ (2.9) ^Capt.$ ____ (2.9) ^strategic$ ____ (2.8) ^tragic$ ____ (2.8) ^magic$ ____ (2.8) ^Magic$ ____ (2.8) ^bankruptcy $____ (2.7) ^T.$ ____(2.7) ^pop$ ____ (2.7) ^Ethiopia$ ____ (2.7) ^Olympics $____ (2.6) ^topics $____ (2.6) ^Sgt.$ ____ (2.6) ^Guy$ ____ (2.6) ^solid$ ____ (2.6) ^slid$ ____ (2.6) ^valid$ ____ (2.6) ^Khalid$ ____ (2.6) ^Tas k$____(2.5) ^Md$ ____(2.5) ^iPod$ ____ (2.5) ^TV$ ____(2.4) ^McQ ueen$____(2.4) ^Did$ ____ (2.4) ^did$ ____ (2.4) ^Georgia$ ____ (2.4) ^Gov. $____ (2.3) ^Tys on$____(2.3) ^spin$ ____ (2.3) ^pin$ ____ (2.3) ^Delta$ ____ (2.3) Filter 69 (bias = -0.42) #
C
w
Q
-
D
o
8
b
7
m
F
v
d
f
"
B
6
i
\194
q
1
J
P
a
c
l
G
.
4
t
9
x
5
e
^
k
Z
\195
X
W
n
U
.
u
I
)
N
y
w
G
t
T
(
P
q
m
l
Y
\$
k
/
h
A
b
5
R
:
%
,
M
Q
g
\194
E
X
;
!
i
&
H
b
f
Q
F
X
j
z
h
V
n
Y
o
W
t
"
S
Z
,
G
s
U
D
k
5
w
l
\195
x
T
C
a
i
m
y
$
d
\163
N
q
e
g
f
G
B
Y
v
Z
t
H
l
X
a
V
o
1
z
4
x
7
N
y
s
3
u
Q
U
$
.
8
b
W
F
"
\195
C
w
2
k
h
_
Non-zero for 14.6% of words.
^CITY $____ (4.1) ^CIT$ ____ (3.5) ^DIEG O$____ (3.5) ^Cabi net$____ (3.3) ^NASDAQ$ ____ (3.2) ^DAX$ ____ (3.2) ^Clay $____ (3.1) ^Clay ton$____ (3.1) ^CNBC $____ (3.1) ^Cleg g$____ (2.8) ^CHICAGO $____ (2.8) ^Clai re$____ (2.7) ^CIA$ ____ (2.7) ^wag es$____(2.6) ^wag e$____(2.6) ^wag ed$____(2.6) ^tag $____(2.6) ^Feb$ ____ (2.6) ^CCTV $____ (2.5) ^Stag e$____ (2.5) ^Cabr era$____ (2.5) ^NASCAR$ ____ (2.5) ^Fabi o$____ (2.4) ^Dwig ht$____ (2.4) ^7.2$ ____ (2.4) ^MOSCOW$ ____ (2.4) ^IV$ ____(2.4) ^Abh isit$____(2.3) ^D.C.$_ ___ (2.3) ^C.$_ ___ (2.3) ^N.C.$_ ___ (2.3) ^S.C.$_ ___ (2.3) ^Broadway $____ (2.3) ^midway $____ (2.3) ^ITV $____(2.3) ^NY$ ____(2.3) ^NW$ ____(2.3) ^midnig ht$____ (2.3) ^lag $____(2.3) ^CNN$ ____ (2.2) Filter 70 (bias = -0.43) #
B
-
Z
d
V
o
k
p
X
j
Q
l
W
f
<BOS>
x
H
s
U
S
A
e
K
g
w
O
Y
y
9
h
C
D
M
E
/
G
b
r
n
t
9
p
Z
t
u
g
8
y
N
O
6
T
(
c
:
i
\194
m
R
P
Q
S
7
G
J
r
"
k
B
l
4
d
?
A
X
C
b
D
&
\163
m
N
G
x
T
n
z
r
U
q
M
9
Y
F
y
R
V
h
Z
f
K
o
X
I
i
a
g
.
P
7
O
8
p
B
D
e
W
j
$
3
f
g
M
T
F
A
'
E
V
a
K
h
/
q
n
d
Z
z
,
D
Q
\163
C
t
$
v
9
0
\194
c
s
G
6
p
S
.
P
r
m
u
Non-zero for 12.4% of words.
^album$ ____ (2.9) ^annum$ ____ (2.9) ^aluminum$ ____ (2.9) ^albums $____ (2.9) ^Dumf ries$____ (2.7) ^Buy$ ____ (2.7) ^3m$ ____(2.4) ^knif e$____ (2.4) ^BNP$ ____ (2.4) ^Zuma $____ (2.3) ^Buff alo$____ (2.3) ^Buff ett$____ (2.3) ^upf ront$____(2.3) ^minimum$ ____ (2.3) ^maximum$ ____ (2.3) ^mum$ ____ (2.3) ^AZUZ $____ (2.2) ^2m$ ____(2.2) ^Ham$ ____ (2.2) ^Cumb ria$____ (2.1) ^Mumb ai$____ (2.1) ^Kuzn etsova$____ (2.1) ^Stadium$ ____ (2.1) ^stadium$ ____ (2.1) ^premium$ ____ (2.1) ^uranium$ ____ (2.1) ^numb er$____ (2.0) ^numb ers$____ (2.0) ^Stamf ord$____ (2.0) ^premiums $____ (2.0) ^stadiums $____ (2.0) ^amn esty$____(2.0) ^umb rella$____(2.0) ^autumn $____ (2.0) ^nume rous$____ (2.0) ^monume nt$____ (2.0) ^Numb er$____ (2.0) ^Quin n$____ (1.9) ^vacuum$ ____ (1.9) ^harmf ul$____ (1.9) Filter 71 (bias = -0.33) #
Z
v
U
x
H
c
/
p
L
f
3
q
A
.
Y
d
V
e
1
h
4
t
I
b
i
o
2
\163
K
-
^
T
X
k
R
r
C
g
M
'
?
z
y
l
h
p
.
P
H
J
"
i
!
I
W
%
(
s
Q
o
\$
R
M
C
Z
t
N
k
q
j
X
v
8
0
:
-
F
\195
e
d
H
z
h
m
r
v
N
U
7
c
R
w
3
d
Y
s
4
T
Q
t
9
D
8
p
1
G
q
.
"
K
S
a
n
l
F
E
j
f
$
-
y
l
"
v
Q
j
W
J
8
z
$
-
O
k
X
b
/
I
M
t
N
w
Y
s
3
B
K
i
Z
g
H
\195
u
E
_
P
Non-zero for 14.4% of words.
^Chry sler$____ (3.5) ^McChry stal$____ (3.5) ^Why $____(3.5) ^Very $____ (3.0) ^fiery $____ (2.9) ^rhy thm$____(2.6) ^3.7$ ____ (2.6) ^Her$ ____ (2.5) ^refinery $____ (2.5) ^machinery $____ (2.5) ^misery $____ (2.5) ^nursery $____ (2.5) ^3.3$ ____ (2.5) ^WHO $____(2.4) ^1.7$ ____ (2.4) ^3.4$ ____ (2.4) ^4.7$ ____ (2.4) ^why $____(2.4) ^Montgomery $____ (2.3) ^U.N. $____ (2.3) ^2.7$ ____ (2.3) ^3.9$ ____ (2.3) ^1.3$ ____ (2.3) ^subsidiary $____ (2.3) ^judiciary $____ (2.3) ^Judiciary $____ (2.3) ^diary $____ (2.3) ^4.3$ ____ (2.3) ^3.8$ ____ (2.3) ^1.4$ ____ (2.2) ^4.4$ ____ (2.2) ^cry $____(2.2) ^cry ing$____(2.2) ^cry stal$____(2.2) ^2.3$ ____ (2.2) ^Mary land$____ (2.2) ^Mary $____ (2.2) ^5.7$ ____ (2.2) ^1.9$ ____ (2.1) ^Keny a$____ (2.1) Filter 72 (bias = -0.47) #
T
N
v
F
z
f
m
x
D
h
U
S
k
n
c
A
u
r
d
L
\194
3
G
5
t
4
-
,
Y
7
M
8
0
a
<BOS>
H
P
e
'
.
?
j
N
p
Z
S
!
f
a
t
(
v
Q
F
H
d
A
s
q
e
:
g
&
l
W
c
U
m
/
T
w
i
\195
x
.
D
X
P
9
k
Y
t
J
p
M
C
K
F
9
g
o
d
"
I
b
c
X
A
W
k
Z
n
\194
r
B
j
L
y
u
i
6
P
-
\163
N
e
R
D
v
,
K
j
W
u
O
k
"
-
S
l
y
v
o
I
8
t
G
g
Q
d
3
r
$
T
5
n
/
b
,
q
Y
H
x
J
N
P
\194
\195
X
D
Non-zero for 13.7% of words.
^Troy $____ (2.5) ^Two$ ____ (2.4) ^NYS E$____(2.3) ^Tax$ ____ (2.1) ^NY$ ____(1.9) ^THE$ ____ (1.9) ^rival$ ____ (1.8) ^approval$ ____ (1.8) ^festival$ ____ (1.8) ^arrival$ ____ (1.8) ^Festival$ ____ (1.8) ^survival$ ____ (1.8) ^removal$ ____ (1.8) ^naval$ ____ (1.8) ^Davy denko$____ (1.8) ^TORO NTO$____ (1.7) ^TIME $____ (1.7) ^Too$ ____ (1.7) ^cab$ ____ (1.7) ^T-Mo bile$____ (1.6) ^Trus t$____ (1.6) ^Tymo shenko$____ (1.6) ^TOKY O$____ (1.6) ^Damo n$____ (1.6) ^DAX$ ____ (1.6) ^Amazo n$____ (1.6) ^Amazo n.com$____ (1.6) ^Jintao$ ____ (1.5) ^UAW$ ____ (1.5) ^Tamp a$____ (1.5) ^stab$ ____ (1.5) ^Tomo rrow$____ (1.5) ^rivals $____ (1.5) ^festivals $____ (1.5) ^approvals $____ (1.5) ^arrivals $____ (1.5) ^Zoo $____(1.5) ^Yao$ ____ (1.5) ^Mao$ ____ (1.5) ^HBO S$____(1.5) Filter 73 (bias = -0.53) #
<BOS>
k
l
v
.
B
Q
U
j
f
X
T
g
y
/
u
S
i
7
p
-
c
L
P
^
m
\194
K
A
a
Y
x
b
W
t
w
k
l
)
x
C
L
w
f
Z
.
R
h
I
%
g
S
V
\$
i
o
U
F
1
:
n
y
u
e
2
N
z
d
T
a
c
m
G
,
H
b
Z
x
y
l
H
v
M
S
Q
s
C
o
U
-
X
f
1
t
r
j
D
z
8
E
/
w
"
i
P
J
$
e
c
p
Y
b
u
a
n
B
t
P
M
U
\194
J
Q
u
W
\195
.
a
"
p
$
i
'
1
X
L
S
G
N
z
/
d
F
R
f
g
e
l
r
b
h
s
Non-zero for 22.9% of words.
^Sky$ ____ (3.1) ^quirky$ ____ (2.7) ^Anyt hing$____ (2.5) ^conflict $____ (2.3) ^conflict s$____ (2.3) ^conflict ing$____ (2.3) ^inflict ed$____ (2.3) ^guy$ ____ (2.3) ^Clint on$____ (2.2) ^Flint off$____ (2.2) ^Clint ons$____ (2.2) ^Clint $____ (2.2) ^full-ye ar$____ (2.2) ^Clayt on$____ (2.1) ^jewelry$ ____ (2.1) ^rivalry$ ____ (2.1) ^sky$ ____ (2.1) ^risky$ ____ (2.1) ^reluct ant$____ (2.1) ^reluct ance$____ (2.1) ^fluct uations$____ (2.1) ^guilty$ ____ (2.1) ^penalty$ ____ (2.1) ^difficulty$ ____ (2.1) ^loyalty$ ____ (2.1) ^energy$ ____ (2.0) ^Energy$ ____ (2.0) ^clergy$ ____ (2.0) ^volunt eers$____ (2.0) ^volunt ary$____ (2.0) ^volunt eer$____ (2.0) ^volunt arily$____ (2.0) ^blunt $____ (2.0) ^volunt eered$____ (2.0) ^AZUZ$ ____ (2.0) ^Any$ ____ (2.0) ^My$ ____(1.9) ^ICC$ ____ (1.9) ^LCD$ ____ (1.8) ^MyS pace$____(1.8) Filter 74 (bias = -0.46) #
6
k
W
t
X
p
2
r
8
m
9
s
3
g
4
T
7
f
Z
c
5
z
N
U
1
P
\194
b
<BOS>
A
Q
l
/
C
^
S
"
F
H
'
u
x
U
p
-
F
T
f
&
h
t
L
R
5
M
8
w
7
z
l
s
6
'
\$
(
P
Y
a
?
b
m
q
"
e
k
y
E
n
\194
0
Q
g
B
y
X
h
9
-
b
i
"
o
\194
u
K
m
V
H
8
n
F
d
6
c
N
1
7
t
W
p
v
D
I
G
P
j
E
l
k
A
x
U
G
u
Y
H
g
I
S
t
c
d
b
P
o
m
0
n
p
L
h
/
v
D
V
,
"
A
9
a
8
y
\194
F
O
w
7
s
'
f
Non-zero for 15.6% of words.
^UBS $____(2.6) ^Hubb le$____ (2.4) ^RBS $____(2.2) ^dubb ed$____ (1.8) ^Tax $____(1.8) ^Queb ec$____ (1.7) ^better-than-ex pected$____ (1.7) ^tax $____(1.7) ^tax es$____(1.7) ^tax payers$____(1.7) ^tax payer$____(1.7) ^tax i$____(1.7) ^tax ation$____(1.7) ^Max $____(1.7) ^Tex as$____(1.6) ^tex t$____(1.6) ^tex ts$____(1.6) ^tex ting$____(1.6) ^6-7$ ____ (1.6) ^Mex ico$____(1.6) ^Mex ican$____(1.6) ^Mex icans$____(1.6) ^pre-tax $____ (1.5) ^urg ed$____(1.5) ^urg ing$____(1.5) ^urg ent$____(1.5) ^urg e$____(1.5) ^CBS $____(1.5) ^USS $____(1.5) ^TB$ ____(1.5) ^sex $____(1.4) ^sex ual$____(1.4) ^sex ually$____(1.4) ^sex y$____(1.4) ^3-6$ ____ (1.4) ^contex t$____ (1.4) ^6-2$ ____ (1.4) ^4-6$ ____ (1.4) ^7-6$ ____ (1.4) ^Exx on$____(1.4) Filter 75 (bias = -0.49) #
<BOS>
p
S
y
Q
-
5
m
E
d
\194
u
4
P
9
i
N
r
F
g
7
n
X
v
V
h
8
T
6
f
B
q
s
o
^
k
A
c
/
a
m
q
M
d
f
1
'
a
K
h
s
H
V
7
w
I
S
2
z
8
k
\163
Z
W
c
4
%
E
U
6
C
:
j
3
o
0
G
p
/
D
R
v
C
q
Y
x
G
w
U
W
V
a
s
e
Z
p
'
f
Q
B
S
2
/
N
$
o
r
t
M
h
g
E
m
d
j
\163
P
i
L
n
Z
P
.
f
g
p
h
k
H
z
Y
U
4
B
$
K
X
T
M
v
n
t
V
l
W
d
7
J
Q
r
\194
I
/
s
A
\195
5
E
8
O
Non-zero for 17.1% of words.
^Ms. $____(3.7) ^S.C. $____ (3.7) ^criticisms$ ____ (3.4) ^mechanisms$ ____ (3.4) ^MGM $____(3.3) ^Mr. $____(3.3) ^HSBC$ ____ (3.2) ^smug gling$____ (3.1) ^smug gled$____ (3.1) ^Ms$ ____(3.1) ^SUV$ ____ (2.9) ^Afgh anistan$____ (2.9) ^Afgh an$____ (2.9) ^Afgh ans$____ (2.9) ^MS$ ____(2.7) ^MRS A$____(2.7) ^Mug abe$____(2.7) ^N.C. $____ (2.7) ^Mr$ ____(2.6) ^FEMA$ ____ (2.6) ^Emmy $____ (2.6) ^Muh ammad$____(2.6) ^fug itive$____(2.6) ^AZUZ $____ (2.6) ^problems$ ____ (2.5) ^seems$ ____ (2.5) ^systems$ ____ (2.5) ^items$ ____ (2.5) ^MP$ ____(2.5) ^Nottinghamsh ire$____ (2.5) ^films$ ____ (2.5) ^Films$ ____ (2.5) ^N.Y. $____ (2.4) ^D-N.Y. $____ (2.4) ^FCC$ ____ (2.4) ^mun icipal$____(2.3) ^'s$ ____(2.3) ^MSN BC$____(2.3) ^NFC$ ____ (2.3) ^mig ht$____(2.3) Filter 76 (bias = -0.45) #
Q
h
C
E
X
i
Z
o
<BOS>
u
\194
-
c
x
/
e
V
s
'
a
^
f
D
S
"
g
M
w
P
r
J
H
j
l
A
G
t
K
q
%
h
z
v
P
n
O
.
U
u
Z
H
m
N
V
B
s
k
E
x
L
A
X
T
;
\$
Y
a
!
F
J
(
Q
c
&
I
c
-
v
H
k
u
b
o
z
J
V
3
B
i
Q
j
p
1
x
h
C
L
'
N
T
l
m
n
X
4
\194
r
"
R
G
I
\163
O
K
2
d
f
-
B
\194
K
Q
x
$
r
.
N
I
k
D
o
"
b
z
h
X
F
p
n
y
J
\195
P
A
M
e
Non-zero for 13.4% of words.
^GB$ ____(3.6) ^Gad dafi$____(3.4) ^Pc$ ____(3.2) ^G.$ ____(3.0) ^Ga$ ____(3.0) ^Gaz a$____(2.6) ^Gaz prom$____(2.6) ^GP$ ____(2.5) ^G8$ ____(2.5) ^PGA$ ____ (2.4) ^LPGA$ ____ (2.4) ^Gas $____(2.4) ^Gas ol$____(2.4) ^Oct ober$____(2.4) ^Oct $____(2.4) ^McQ ueen$____(2.4) ^MOSCOW$ ____ (2.4) ^Pad res$____(2.3) ^Obs erver$____(2.2) ^Gat es$____(2.2) ^Gat e$____(2.2) ^Gat wick$____(2.2) ^PC$ ____(2.2) ^GM$ ____(2.2) ^GE$ ____(2.2) ^K.$ ____(2.2) ^scu lpture$____(2.1) ^scu lptures$____(2.1) ^Ecu ador$____(2.1) ^Okl ahoma$____(2.1) ^Up$ ____(2.1) ^UC$ ____(2.1) ^Gal lery$____(2.1) ^Gal axy$____(2.1) ^Gal lup$____(2.1) ^Gal loway$____(2.1) ^Gav in$____(2.1) ^MPC$ ____ (2.0) ^MGM$ ____ (2.0) ^PKK$ ____ (2.0) Filter 77 (bias = -0.58) #
<BOS>
p
Q
u
X
i
\194
y
V
d
C
f
Z
-
^
x
.
a
M
P
t
h
/
o
A
U
c
s
S
1
m
J
\195
3
L
f
a
j
A
F
h
C
b
P
.
'
g
n
Y
I
L
R
q
-
?
M
W
,
w
s
G
t
z
S
E
;
!
k
T
\194
y
7
x
9
B
S
u
F
-
p
w
8
v
Q
U
y
z
X
n
G
.
"
\195
7
J
O
R
4
a
5
m
K
l
e
o
3
I
6
s
x
t
c
i
h
B
Y
D
V
I
m
N
k
e
h
q
s
t
b
d
'
w
S
o
g
2
G
O
i
E
U
K
x
p
u
\163
R
c
C
n
H
0
Z
T
"
a
Non-zero for 25.3% of words.
^fem ale$____(2.7) ^fem ales$____(2.7) ^fes tival$____(2.5) ^fes tivals$____(2.5) ^fes tive$____(2.5) ^fes tivities$____(2.5) ^Ips wich$____(2.4) ^Fes tival$____(2.3) ^Feb ruary$____(2.3) ^Feb $____(2.3) ^sym ptoms$____(2.2) ^sym bol$____(2.2) ^sym pathy$____(2.2) ^sym bolic$____(2.2) ^Pes hawar$____(2.2) ^Sym phony$____(2.1) ^'S$ ____(2.1) ^FRANCISC O$____ (2.1) ^sys tem$____(2.1) ^sys tems$____(2.1) ^sys temic$____(2.1) ^sys tematic$____(2.1) ^Afgh anistan$____ (2.0) ^Afgh an$____ (2.0) ^Afgh ans$____ (2.0) ^feu d$____(2.0) ^Sys tems$____(2.0) ^Sys tem$____(2.0) ^FSA $____(2.0) ^Rem ember$____(2.0) ^IS$ ____(2.0) ^CCTV $____ (1.9) ^Mem bers$____(1.9) ^Mem orial$____(1.9) ^Mem phis$____(1.9) ^Mem ber$____(1.9) ^nes t$____(1.9) ^MyS pace$____(1.9) ^witnes ses$____ (1.9) ^witnes s$____ (1.9) Filter 78 (bias = -0.47) #
x
u
S
U
F
T
\194
-
Q
w
5
\195
7
J
<BOS>
H
V
i
j
r
h
P
.
R
8
m
6
o
X
z
l
y
4
M
C
k
'
E
c
I
f
g
M
p
F
h
Z
T
B
G
N
i
K
A
9
1
J
c
'
O
m
t
n
y
/
d
V
a
b
\163
\194
H
L
E
-
Y
6
0
:
D
V
-
k
f
B
d
v
.
Y
l
T
s
Z
r
0
L
M
o
c
y
X
a
W
O
C
e
G
,
9
u
4
N
b
I
g
F
\194
p
6
t
H
j
y
f
1
k
W
v
L
c
Y
s
X
g
6
S
/
F
h
b
u
x
a
'
2
z
"
t
U
-
Q
w
3
r
d
e
8
R
4
J
Non-zero for 15.8% of words.
^SUV$ ____ (3.2) ^Fuku da$____ (3.0) ^HSBC$ ____ (3.0) ^Such $____ (2.7) ^execu tive$____ (2.5) ^execu tives$____ (2.5) ^Execu tive$____ (2.5) ^execu tion$____ (2.5) ^MSNBC $____ (2.4) ^Secu rity$____ (2.4) ^Secu rities$____ (2.4) ^sky $____(2.4) ^Suga r$____ (2.3) ^Nku nda$____(2.3) ^halfwa y$____ (2.3) ^McL aren$____(2.2) ^McL ean$____(2.2) ^BMW $____(2.2) ^MVP $____(2.1) ^Seba stian$____ (2.1) ^Suzu ki$____ (2.1) ^levy $____ (2.1) ^Sovi et$____ (2.0) ^ex-wi fe$____ (2.0) ^McQ ueen$____(2.0) ^Demjanjuk$ ____ (2.0) ^FT$ ____(2.0) ^guardian.co.uk$ ____ (2.0) ^Afgh anistan$____ (1.9) ^Afgh an$____ (1.9) ^Afgh ans$____ (1.9) ^GMT$ ____ (1.9) ^affid avit$____ (1.9) ^5-0$ ____ (1.8) ^McD onald$____(1.8) ^McD onnell$____(1.8) ^BT$ ____(1.8) ^agency $____ (1.8) ^emergency $____ (1.8) ^Agency $____ (1.8) Filter 79 (bias = -0.46) #
c
r
D
E
C
i
\194
u
Z
h
X
k
/
a
G
-
Q
w
m
f
M
b
d
H
L
J
^
q
z
B
.
e
6
R
0
o
'
\195
8
N
r
v
R
t
N
W
J
\194
o
m
\195
T
Z
k
3
d
9
p
n
i
K
x
L
a
M
'
G
q
j
:
8
"
%
V
P
S
?
;
u
z
Y
f
V
y
X
d
W
r
b
n
\194
D
4
U
S
u
h
c
6
o
g
P
Q
-
7
C
$
N
"
R
x
s
5
,
p
t
I
H
p
Q
k
/
v
7
c
X
z
L
x
j
w
\194
b
$
f
6
B
D
a
4
o
Y
K
l
G
M
U
s
m
g
i
r
Non-zero for 25.7% of words.
^Jacob$ ____ (3.2) ^NY$ ____(3.0) ^massacre$ ____ (2.8) ^Dog$ ____ (2.7) ^NW$ ____(2.6) ^DNA$ ____ (2.6) ^NYS E$____(2.5) ^protocol$ ____ (2.5) ^Protocol$ ____ (2.5) ^Dr.$ ____ (2.5) ^Cox$ ____ (2.4) ^MOSCOW$ ____ (2.4) ^crie d$____ (2.4) ^crie s$____ (2.4) ^Dubl in$____ (2.4) ^coll ege$____ (2.3) ^coll eagues$____ (2.3) ^coll apse$____ (2.3) ^coll ection$____ (2.3) ^Col$ ____ (2.3) ^telecom$ ____ (2.3) ^Ofcom$ ____ (2.3) ^Telecom$ ____ (2.3) ^Viacom$ ____ (2.3) ^DAX$ ____ (2.3) ^mob$ ____ (2.2) ^crit ical$____ (2.2) ^crit icism$____ (2.2) ^crit ics$____ (2.2) ^crit icized$____ (2.2) ^crit icised$____ (2.2) ^crit ic$____ (2.2) ^crit eria$____ (2.2) ^crit ically$____ (2.2) ^cred it$____ (2.2) ^cred its$____ (2.2) ^cred ibility$____ (2.2) ^cred ited$____ (2.2) ^obj ect$____(2.2) ^obj ects$____(2.2) Filter 80 (bias = -0.55) #
X
w
Q
o
F
-
"
u
P
n
M
a
y
A
8
q
6
R
Z
r
G
t
V
v
m
h
K
.
D
\195
^
I
\194
N
7
g
/
i
l
w
x
W
h
Q
u
?
P
X
d
.
R
t
U
(
L
!
p
I
D
M
v
E
%
e
f
O
9
:
o
J
0
l
8
C
Q
-
B
u
K
g
S
d
X
c
W
n
N
h
F
y
E
1
A
D
,
C
/
p
f
i
\194
v
"
r
5
R
L
o
H
j
G
-
B
g
f
j
k
G
,
J
a
d
U
E
A
e
t
Z
K
X
x
0
N
D
F
2
y
w
h
M
C
7
R
.
S
6
o
1
s
4
r
Non-zero for 9.9% of words.
^wag es$____(3.3) ^wag e$____(3.3) ^wag ed$____(3.3) ^sewag e$____ (2.9) ^Wag ner$____(2.6) ^Volkswag en$____ (2.4) ^tag $____(2.3) ^25- year-old$____(2.3) ^ex- wife$____(2.2) ^26- year-old$____(2.2) ^Stag e$____ (2.2) ^Meg an$____(2.1) ^Meg rahi$____(2.1) ^Mag ic$____(2.1) ^Mag azine$____(2.1) ^Mag istrates$____(2.1) ^Mag na$____(2.1) ^Eag les$____(2.1) ^Eag le$____(2.1) ^eag er$____(2.1) ^eag le$____(2.1) ^eag erly$____(2.1) ^wed ding$____(2.0) ^wed dings$____(2.0) ^Norweg ian$____ (2.0) ^Dwig ht$____ (2.0) ^wee k$____(2.0) ^wee ks$____(2.0) ^wee kend$____(2.0) ^wee kly$____(2.0) ^wee kends$____(2.0) ^22- year-old$____(2.0) ^Afg hanistan$____(2.0) ^Afg han$____(2.0) ^Afg hans$____(2.0) ^45- year-old$____(2.0) ^24- year-old$____(2.0) ^24- hour$____(2.0) ^28- year-old$____(1.9) ^Al- Qaeda$____(1.9) Filter 81 (bias = -0.51) #
R
m
s
w
C
.
9
p
F
y
S
q
8
l
u
A
7
o
U
r
4
K
\194
a
j
e
6
g
k
W
5
X
3
b
V
O
0
t
"
L
X
u
F
o
7
T
Z
y
6
s
5
U
n
h
Q
O
V
t
8
-
L
i
P
Y
9
k
!
E
2
g
j
R
I
v
l
m
\$
c
/
z
B
g
Q
T
9
-
N
p
8
d
Z
y
V
t
5
D
F
m
7
c
x
i
/
O
6
j
n
e
,
G
\194
u
K
o
A
\163
4
v
L
E
y
v
m
g
K
0
P
j
Q
E
M
w
f
x
/
h
O
J
"
s
X
z
r
5
,
A
'
u
$
4
p
-
U
i
L
n
Z
c
I
Non-zero for 21.1% of words.
^sexy $____ (3.2) ^funny $____ (3.1) ^sunny $____ (3.1) ^slay ing$____ (3.0) ^1979$ ____ (2.8) ^1969$ ____ (2.8) ^Clay $____ (2.7) ^Clay ton$____ (2.7) ^Islam ic$____ (2.7) ^Islam $____ (2.7) ^Islam ist$____ (2.7) ^Islam abad$____ (2.7) ^slam med$____ (2.7) ^slam $____ (2.7) ^1959$ ____ (2.6) ^1978$ ____ (2.6) ^tsunam i$____ (2.6) ^Fay ed$____(2.5) ^1968$ ____ (2.5) ^1975$ ____ (2.5) ^1977$ ____ (2.5) ^1965$ ____ (2.4) ^1989$ ____ (2.4) ^1967$ ____ (2.4) ^1976$ ____ (2.4) ^1958$ ____ (2.4) ^snap ped$____ (2.4) ^snap $____ (2.4) ^snap ping$____ (2.4) ^Tuesday $____ (2.4) ^Wednesday $____ (2.4) ^Thursday $____ (2.4) ^1966$ ____ (2.3) ^1999$ ____ (2.3) ^1955$ ____ (2.3) ^1957$ ____ (2.3) ^1974$ ____ (2.3) ^nicknam e$____ (2.3) ^nicknam ed$____ (2.3) ^Disney $____ (2.2) Filter 82 (bias = -0.40) #
Z
f
X
l
W
j
G
-
Q
F
c
x
z
h
K
r
"
s
T
p
2
u
Y
i
^
S
w
e
\194
n
8
t
0
d
U
P
/
k
b
;
A
-
y
v
c
\194
L
"
a
Y
n
V
.
J
C
X
D
k
g
Q
h
j
U
R
F
W
s
M
o
'
G
T
,
b
p
:
\$
9
d
Q
p
R
w
Y
f
V
y
C
v
7
e
\194
o
Z
i
$
x
X
E
/
t
c
m
-
d
T
a
W
\163
O
w
'
E
d
J
\194
B
C
i
.
2
c
K
D
e
/
a
y
I
"
3
$
W
Q
b
m
A
u
\195
-
k
F
4
h
z
v
O
n
_
Non-zero for 20.3% of words.
^D-Ca lif$____ (3.5) ^SOURCE $____ (3.5) ^;$_ ___(3.1) ^-$_ ___(3.1) ^XVI $____(3.0) ^SEATTLE $____ (3.0) ^tackli ng$____ (3.0) ^THE $____(3.0) ^ICE $____(2.9) ^MRI $____(2.8) ^low-ke y$____ (2.8) ^v$_ ___(2.7) ^tackle $____ (2.6) ^reckle ss$____ (2.6) ^tackle d$____ (2.6) ^tackle s$____ (2.6) ^backla sh$____ (2.6) ^ANGELE S$____ (2.6) ^back$_ ___ (2.6) ^attack$_ ___ (2.6) ^black$_ ___ (2.6) ^Barack$_ ___ (2.6) ^stock$_ ___ (2.6) ^lack$_ ___ (2.6) ^CITY$_ ___ (2.5) ^Zuri ch$____ (2.5) ^NYSE $____ (2.5) ^GM$_ ___ (2.5) ^MGM$_ ___ (2.5) ^TV$_ ___ (2.5) ^ITV$_ ___ (2.5) ^MTV$_ ___ (2.5) ^CCTV$_ ___ (2.5) ^"$_ ___(2.4) ^Y$_ ___(2.4) ^IAE A$____(2.4) ^Vla dimir$____(2.4) ^vow ed$____(2.4) ^vow ing$____(2.4) ^V$_ ___(2.4) Filter 83 (bias = -0.45) #
c
J
.
P
D
i
"
\195
\194
p
Q
B
M
k
g
f
d
r
S
R
F
l
y
b
t
a
\163
n
^
I
e
K
8
o
X
U
h
-
T
w
G
f
z
k
L
F
&
u
l
H
g
i
Y
t
D
h
0
B
c
q
X
,
O
n
o
e
K
I
!
\$
Z
a
.
s
/
;
5
y
Q
x
-
y
X
U
I
c
J
f
j
A
7
F
6
h
V
s
2
B
\194
K
i
L
l
o
q
r
Q
N
W
m
$
u
4
t
5
S
1
.
d
D
B
-
K
g
N
p
U
d
Q
.
,
v
R
x
/
w
S
n
Y
m
9
c
O
b
L
j
M
h
"
e
8
q
3
'
X
\163
\194
i
T
u
Non-zero for 14.7% of words.
^D-N .Y.$____(3.0) ^10-K $____ (2.7) ^co-o peration$____ (2.7) ^co-f ounder$____ (2.5) ^FOXN ews.com$____ (2.5) ^GPS $____(2.4) ^co-a uthor$____ (2.2) ^Al-Q aeda$____ (2.1) ^protocol$ ____ (2.0) ^Protocol$ ____ (2.0) ^GP$ ____(2.0) ^LeB ron$____(2.0) ^D-C alif$____(1.9) ^G20 $____(1.9) ^0-2 $____(1.9) ^Glo bal$____(1.9) ^Glo be$____(1.9) ^Glo ucester$____(1.9) ^Glo ucestershire$____(1.9) ^Fritzl$ ____ (1.9) ^cop$ ____ (1.8) ^Moscow$ ____ (1.8) ^cow$ ____ (1.8) ^5-3 $____(1.8) ^article$ ____ (1.8) ^vehicle$ ____ (1.8) ^cycle$ ____ (1.8) ^circle$ ____ (1.8) ^FOX$ ____ (1.8) ^G8$ ____(1.7) ^icon$ ____ (1.7) ^Silicon$ ____ (1.7) ^bacon$ ____ (1.7) ^DAX$ ____ (1.7) ^Gif fords$____(1.7) ^0-1 $____(1.7) ^0-0 $____(1.7) ^clif f$____ (1.7) ^Radclif fe$____ (1.7) ^G$_ ___(1.6) Filter 84 (bias = -0.30) #
<BOS>
x
M
p
U
h
m
a
u
q
Z
g
\194
w
/
o
Y
c
D
e
X
r
V
f
^
E
Q
A
R
N
P
S
'
\163
H
b
T
y
L
W
E
v
?
C
W
D
Q
P
(
n
:
f
g
m
.
)
!
c
h
d
A
J
\$
0
w
M
r
T
a
B
S
z
"
U
q
k
H
l
O
p
Z
h
X
a
M
y
V
A
6
o
j
t
J
r
5
U
9
u
7
p
2
s
4
O
w
T
Q
k
n
l
8
d
\194
S
e
,
-
i
$
z
O
-
K
j
W
d
Y
.
y
v
U
n
G
F
"
l
T
x
3
e
Q
u
o
f
B
D
r
g
,
s
X
'
i
\194
N
t
E
q
k
b
Non-zero for 25.7% of words.
^AZU Z$____(4.0) ^Many $____ (3.4) ^JERUSALEM$ ____ (3.4) ^DAX$ ____ (3.2) ^Majo r$____ (3.2) ^Majo rity$____ (3.2) ^many $____ (3.2) ^Germany $____ (3.2) ^four-y ear$____ (3.0) ^majo r$____ (3.0) ^majo rity$____ (3.0) ^majo rs$____ (3.0) ^EMI $____(2.9) ^WHO $____(2.9) ^Any $____(2.8) ^Any one$____(2.8) ^Any thing$____(2.8) ^Any way$____(2.8) ^AM$ ____(2.8) ^DIEGO $____ (2.8) ^UEFA $____ (2.8) ^rugby $____ (2.7) ^Rugby $____ (2.7) ^TIME$_ ___ (2.7) ^any $____(2.6) ^any thing$____(2.6) ^any one$____(2.6) ^any where$____(2.6) ^Sny der$____(2.6) ^ACO RN$____(2.6) ^Egy pt$____(2.6) ^Egy ptian$____(2.6) ^ISLAMA BAD$____ (2.5) ^CHICAGO $____ (2.5) ^MGM$ ____ (2.5) ^Mani la$____ (2.5) ^Lawy ers$____ (2.5) ^They $____ (2.5) ^turno ut$____ (2.5) ^turno ver$____ (2.5) Filter 85 (bias = -0.40) #
<BOS>
t
V
y
-
T
b
A
'
c
Q
o
Z
D
X
a
Y
O
J
,
7
h
j
B
\194
N
R
U
9
p
"
i
^
K
6
w
G
H
<EOS>
q
M
d
B
z
f
.
H
l
K
!
)
a
k
&
Z
s
F
I
V
Q
m
g
J
-
n
:
4
G
i
p
3
O
N
A
W
\163
9
D
6
r
Q
y
l
c
j
p
X
w
7
k
\194
i
J
g
L
U
6
a
9
m
R
t
/
h
N
T
$
s
F
z
Y
W
8
v
V
C
5
G
o
M
d
V
p
C
a
Z
l
k
-
'
e
R
h
Q
D
$
y
B
q
S
L
/
P
n
x
Y
1
K
E
\194
i
c
\163
u
T
v
Non-zero for 16.8% of words.
^MLS $____(3.1) ^MDC $____(3.0) ^MRS A$____(2.9) ^MPC $____(2.8) ^Volk swagen$____ (2.8) ^rebel$ ____ (2.6) ^label$ ____ (2.6) ^Nobel$ ____ (2.6) ^libel$ ____ (2.6) ^MS$ ____(2.6) ^bulk $____ (2.6) ^M.$ ____(2.5) ^MP$ ____(2.4) ^Me$ ____(2.4) ^MSN BC$____(2.4) ^Kabul$ ____ (2.3) ^Istanbul$ ____ (2.3) ^rebels $____ (2.3) ^labels $____ (2.3) ^Mr$ ____(2.3) ^Men $____(2.2) ^Men tal$____(2.2) ^HMRC $____ (2.2) ^MRI $____(2.2) ^silk $____ (2.2) ^1947$ ____ (2.1) ^MI5 $____(2.1) ^Blo omberg$____(2.1) ^Blo od$____(2.1) ^Milk $____ (2.0) ^MPs $____(2.0) ^M$_ ___(2.0) ^milk $____ (2.0) ^symbol$ ____ (2.0) ^Mes si$____(2.0) ^47$ ____(2.0) ^G.M.$ ____ (2.0) ^RBI$ ____ (2.0) ^MIA MI$____(2.0) ^RBS$ ____ (2.0) Filter 86 (bias = -0.62) #
P
h
p
A
C
.
G
H
z
q
U
W
R
t
c
N
K
w
d
a
f
S
k
e
J
E
'
l
9
Y
s
4
D
L
Z
g
\195
0
k
J
h
L
'
%
t
D
Y
P
C
E
S
K
Q
\195
?
-
g
d
V
e
W
0
A
2
H
9
"
z
\194
o
(
6
.
f
!
3
l
\195
S
q
s
J
h
I
f
X
x
Z
F
B
g
a
j
z
y
U
t
P
'
9
c
w
i
2
C
b
p
N
o
Q
,
T
m
u
O
R
G
Q
n
X
g
"
-
W
t
Y
j
$
s
\194
f
8
c
/
i
6
k
Z
w
p
v
o
C
u
z
_
l
x
Non-zero for 10.9% of words.
^PHIL ADELPHIA$____ (2.8) ^vodka$ ____ (2.8) ^Alaska$ ____ (2.6) ^Nebraska$ ____ (2.6) ^Lanka$ ____ (2.3) ^Sasha$ ____ (2.3) ^Costa$ ____ (2.2) ^Vista$ ____ (2.2) ^Augusta$ ____ (2.2) ^pasta$ ____ (2.2) ^Ethiopia$ ____ (2.1) ^DAX$ ____ (2.1) ^NASCAR$ ____ (2.0) ^TEHRAN$ ____ (2.0) ^Jakarta$ ____ (2.0) ^Alberta$ ____ (2.0) ^Ch\195\161 vez$____ (2.0) ^Atlanta$ ____ (2.0) ^Santa$ ____ (2.0) ^junta$ ____ (2.0) ^Schwartz$ ____ (2.0) ^Capt.$ ____ (1.9) ^acceptab le$____ (1.9) ^unacceptab le$____ (1.9) ^Mogadishu$ ____ (1.9) ^Phar maceuticals$____ (1.9) ^Abkhazia$ ____ (1.9) ^Huckab ee$____ (1.8) ^Aziz$ ____ (1.8) ^Rebecca$ ____ (1.8) ^CCTV $____ (1.8) ^Doha$ ____ (1.7) ^capita$ ____ (1.7) ^Anita$ ____ (1.7) ^CBI$ ____ (1.7) ^Garcia$ ____ (1.7) ^Valencia$ ____ (1.7) ^Patricia$ ____ (1.7) ^Anelka$ ____ (1.7) ^SOURCE$ ____ (1.7) Filter 87 (bias = -0.44) #
P
g
,
h
Q
E
p
c
I
v
/
j
<BOS>
o
f
u
K
0
U
.
X
-
C
e
m
G
'
w
^
J
k
D
a
x
T
M
N
?
i
R
p
.
l
c
,
Q
I
"
%
r
w
b
t
u
H
'
f
&
2
N
1
v
W
Z
P
9
5
)
a
(
n
8
m
\194
4
Y
O
Q
v
Z
m
X
g
7
T
I
h
8
o
N
t
/
x
9
i
n
l
C
p
$
z
2
s
R
c
"
b
3
k
6
E
f
S
G
g
f
Y
,
h
p
Z
P
V
t
M
d
b
I
j
O
H
a
G
U
J
K
4
s
.
z
X
y
0
o
u
F
-
/
7
D
m
S
w
\163
Non-zero for 16.5% of words.
^Prag ue$____ (2.9) ^Punj ab$____ (2.7) ^PRNe wswire$____ (2.7) ^PRNe wswire-FirstCall$____ (2.7) ^PRNe wswire-USNewswire$____ (2.7) ^plung ed$____ (2.5) ^lung $____ (2.5) ^plung e$____ (2.5) ^plung ing$____ (2.5) ^Hung ary$____ (2.5) ^Hung arian$____ (2.5) ^Peng uins$____ (2.4) ^U.N. $____ (2.4) ^PR$_ ___ (2.4) ^Aung $____ (2.3) ^prag matic$____ (2.3) ^P.$_ ___ (2.3) ^Pc$_ ___ (2.3) ^urg ed$____(2.2) ^urg ing$____(2.2) ^urg ent$____(2.2) ^urg e$____(2.2) ^urg ency$____(2.2) ^urg ently$____(2.2) ^Samsung $____ (2.2) ^sung lasses$____ (2.2) ^cag e$____(2.2) ^frag ile$____ (2.1) ^frag ments$____ (2.1) ^Chicag o$____ (2.1) ^McQu een$____ (2.1) ^unh appy$____(2.1) ^unh ealthy$____(2.1) ^Kong $____ (2.0) ^preg nant$____ (1.9) ^preg nancy$____ (1.9) ^Reg ional$____(1.9) ^Reg iment$____(1.9) ^Reg ion$____(1.9) ^Reg ardless$____(1.9) Filter 88 (bias = -0.59) #
m
x
Y
n
H
c
T
o
u
f
M
N
U
5
V
S
b
,
X
w
Z
t
P
s
J
F
"
a
\195
p
k
C
G
.
L
I
<BOS>
2
^
A
!
m
a
M
Q
f
q
j
(
o
?
%
:
T
W
G
I
p
.
K
2
S
7
O
\194
P
&
J
8
l
9
r
X
y
6
g
"
D
1
c
i
v
Y
.
G
c
U
q
K
x
O
d
3
F
L
t
J
N
m
n
V
f
H
e
z
r
P
-
Z
\163
4
'
1
h
S
D
l
b
s
j
Q
m
R
h
"
y
\194
g
'
n
z
M
U
i
I
w
9
e
$
H
k
L
r
j
P
f
Y
o
b
t
s
A
8
5
1
c
D
Non-zero for 16.7% of words.
^Dubai$ ____ (2.8) ^Mumbai$ ____ (2.8) ^maid en$____ (2.7) ^normal$ ____ (2.6) ^animal$ ____ (2.6) ^formal$ ____ (2.6) ^minimal$ ____ (2.6) ^Christmas$ ____ (2.6) ^Thomas$ ____ (2.6) ^Hamas$ ____ (2.6) ^Tomas$ ____ (2.6) ^dramas$ ____ (2.6) ^Bahamas$ ____ (2.6) ^Obamas$ ____ (2.6) ^mask $____ (2.5) ^mask s$____ (2.5) ^mask ed$____ (2.5) ^Ham$ ____ (2.4) ^columnis t$____ (2.4) ^Pais ley$____ (2.4) ^air $____(2.3) ^air port$____(2.3) ^air craft$____(2.3) ^air line$____(2.3) ^animals $____ (2.3) ^mammals $____ (2.3) ^mass ive$____ (2.3) ^mass $____ (2.3) ^mass acre$____ (2.3) ^mass es$____ (2.3) ^mass age$____ (2.3) ^amass ed$____ (2.3) ^Hamb urg$____ (2.2) ^may$ ____ (2.2) ^dismay$ ____ (2.2) ^Raik konen$____ (2.2) ^e-mail $____ (2.2) ^mail $____ (2.2) ^email $____ (2.2) ^e-mail s$____ (2.2) Filter 89 (bias = -0.45) #
I
x
t
b
E
h
<BOS>
f
O
c
D
k
Q
m
2
n
d
v
T
B
z
y
U
V
/
o
^
'
H
p
,
g
1
M
X
.
r
F
\194
r
X
y
D
p
j
w
l
u
6
k
L
i
F
-
Q
a
5
?
7
o
S
U
0
f
&
n
V
g
/
\195
M
s
(
R
8
h
P
V
u
C
E
p
h
Z
-
P
o
z
e
k
N
n
t
K
H
X
.
G
d
m
y
b
T
'
D
B
O
c
"
Q
q
/
S
5
j
l
s
y
v
O
q
G
j
"
J
S
l
K
B
Q
n
s
I
$
t
'
b
/
T
U
e
Y
u
W
0
,
-
M
k
8
a
m
A
\195
w
Non-zero for 16.5% of words.
^FRANCISCO $____ (4.2) ^MOSCO W$____ (3.5) ^ITV$ ____ (3.2) ^ICC$ ____ (3.0) ^DC$ ____(2.8) ^Sky $____(2.7) ^Sky pe$____(2.7) ^Sny der$____(2.6) ^FC$ ____(2.5) ^QC$ ____(2.5) ^LPG A$____(2.5) ^AFC$ ____ (2.4) ^PLC$ ____ (2.4) ^USC$ ____ (2.4) ^IOC$ ____ (2.4) ^DIEGO $____ (2.4) ^Ill$ ____ (2.4) ^OTC$ ____ (2.3) ^deny $____ (2.3) ^deny ing$____ (2.3) ^helps $____ (2.2) ^Phelps $____ (2.2) ^steps $____ (2.2) ^footsteps $____ (2.2) ^ACO RN$____(2.2) ^http$ ____ (2.2) ^help$ ____ (2.2) ^Help$ ____ (2.2) ^step$ ____ (2.2) ^FIFA$ ____ (2.2) ^CDC$ ____ (2.1) ^FCC $____(2.1) ^Academy $____ (2.1) ^academy $____ (2.1) ^LLC$ ____ (2.1) ^IPCC $____ (2.0) ^BCS $____(2.0) ^TVs $____(2.0) ^Diy ala$____(2.0) ^TV$ ____(1.9) Filter 90 (bias = -0.56) #
O
B
-
h
<BOS>
F
Q
x
/
v
y
k
"
0
'
j
w
b
.
J
W
l
^
A
M
4
m
9
o
a
6
i
7
5
L
A
v
I
-
C
f
Q
x
H
m
&
b
Z
o
7
p
1
u
2
'
5
.
(
e
4
k
/
d
3
h
X
q
U
y
,
c
L
M
8
w
'
w
C
E
"
J
y
2
k
e
S
I
c
q
F
i
\194
-
R
a
f
0
Q
1
Y
z
h
\195
s
g
x
5
m
3
$
W
/
A
,
X
d
K
C
w
u
E
n
W
D
o
P
N
I
O
H
M
1
B
F
b
s
e
k
X
U
G
'
J
t
S
R
"
7
Y
i
f
j
l
Non-zero for 14.4% of words.
^Asd a$____(4.2) ^AFC $____(4.2) ^ICC $____(4.1) ^NASC AR$____ (3.8) ^Aud i$____(3.8) ^Aud it$____(3.8) ^Add $____(3.5) ^Add itionally$____(3.5) ^Add itional$____(3.5) ^Add ing$____(3.5) ^And $____(3.5) ^And rew$____(3.5) ^And y$____(3.5) ^And erson$____(3.5) ^Abd ullah$____(3.5) ^Abd ul$____(3.5) ^Abd ulmutallab$____(3.5) ^Hyd e$____(3.5) ^AFP $____(3.3) ^Amn esty$____(3.3) ^WASH INGTON$____ (3.3) ^ABC $____(3.1) ^ANC $____(3.1) ^NASD AQ$____ (3.1) ^AC$ ____(3.0) ^Act $____(3.0) ^Act ion$____(3.0) ^Act ually$____(3.0) ^Act or$____(3.0) ^GMAC$ ____ (3.0) ^Arn old$____(3.0) ^Aft er$____(3.0) ^Ass ociation$____(3.0) ^Ass ociated$____(3.0) ^Ass embly$____(3.0) ^Ass istant$____(3.0) ^Ask ed$____(2.9) ^Ask $____(2.9) ^Aun g$____(2.9) ^Aid $____(2.9) Filter 91 (bias = -0.40) #
Y
p
\194
r
h
P
H
f
u
z
6
w
"
O
4
I
7
t
8
c
W
E
M
K
X
e
9
k
V
g
^
\163
Z
a
/
G
L
s
Q
A
N
m
Q
g
B
i
9
p
q
G
F
y
"
l
\$
%
E
d
8
z
R
C
(
-
b
L
r
D
e
Y
x
1
f
H
v
s
W
T
I
c
Y
f
V
y
z
-
X
u
l
e
G
F
\194
.
0
N
b
M
B
r
k
d
Q
n
7
o
C
h
J
t
6
E
A
w
T
c
$
s
R
D
h
p
Y
f
\194
P
A
y
b
O
.
K
Q
o
V
-
u
m
7
n
H
c
4
r
B
w
L
i
x
M
"
,
a
e
q
d
9
G
v
t
Non-zero for 16.8% of words.
^NYS E$____(3.3) ^NY$ ____(3.2) ^NBA $____(2.7) ^Hezb ollah$____ (2.7) ^NCA A$____(2.6) ^NYC $____(2.5) ^Nku nda$____(2.4) ^Sanchez$ ____ (2.2) ^Blu e$____(2.2) ^Blu es$____(2.2) ^Blu -ray$____(2.2) ^haza rdous$____ (2.1) ^haza rd$____ (2.1) ^haza rds$____ (2.1) ^VEGA S$____ (2.1) ^Rodriguez$ ____ (2.1) ^HIV$ ____ (2.1) ^Bla ck$____(2.1) ^Bla ir$____(2.1) ^Bla ke$____(2.1) ^Bla ckburn$____(2.1) ^Bla ckwater$____(2.1) ^Bla ckBerry$____(2.1) ^Bla cks$____(2.1) ^Yell ow$____ (2.0) ^BAA $____(2.0) ^Yeah $____ (1.9) ^Nix on$____(1.9) ^Sheph erd$____ (1.9) ^NAS A$____(1.9) ^NAS CAR$____(1.9) ^NAS DAQ$____(1.9) ^shelv es$____ (1.9) ^Melb ourne$____ (1.9) ^Rachel$ ____ (1.8) ^Michel$ ____ (1.8) ^bushel$ ____ (1.8) ^Murph y$____ (1.8) ^Venezuela $____ (1.8) ^Venezuela n$____ (1.8) Filter 92 (bias = -0.37) #
H
v
L
-
A
b
1
k
2
x
y
'
U
g
/
p
3
f
,
c
D
<BOS>
4
j
K
r
5
.
Z
R
6
m
X
o
N
z
8
V
O
s
"
A
v
L
\194
n
W
a
Y
g
Q
l
M
p
;
U
9
C
-
%
R
z
T
m
'
y
(
P
X
s
N
i
:
d
o
!
q
Z
S
F
Y
p
U
d
Z
j
B
F
W
f
K
e
z
c
/
g
V
h
Q
r
X
P
$
x
w
t
M
y
m
D
L
n
\194
\163
"
-
b
C
A
i
g
N
G
r
V
q
C
u
c
a
S
.
i
\195
p
I
s
L
k
Q
m
l
5
H
4
R
M
D
j
A
x
d
'
B
0
-
Z
U
Y
E
Non-zero for 12.8% of words.
^HOUS TON$____ (3.2) ^HSBC $____ (3.0) ^HMRC $____ (2.8) ^Loug hner$____ (2.6) ^LSU$ ____ (2.3) ^Hewi tt$____ (2.3) ^Loui s$____ (2.3) ^Loui siana$____ (2.3) ^Loui sville$____ (2.3) ^Loui se$____ (2.3) ^JERUS ALEM$____ (2.2) ^Lewi s$____ (2.2) ^Hous e$____ (2.2) ^Hous ton$____ (2.2) ^Hous ing$____ (2.2) ^NYC $____(2.1) ^NYS E$____(2.0) ^SUV $____(2.0) ^Wyomi ng$____ (2.0) ^Leag ue$____ (1.9) ^Doug las$____ (1.9) ^Doug $____ (1.9) ^Hosp ital$____ (1.9) ^Hosp itals$____ (1.9) ^How$ ____ (1.8) ^Hoss ein$____ (1.8) ^Domi nican$____ (1.8) ^Domi nic$____ (1.8) ^UAW$_ ___ (1.8) ^RBS $____(1.8) ^Low$ ____ (1.8) ^Huss ein$____ (1.7) ^Huss ain$____ (1.7) ^Holi day$____ (1.7) ^vag ue$____(1.7) ^WAS HINGTON$____(1.7) ^NASDAQ$_ ___ (1.7) ^Hong $____ (1.7) ^PRNewswire-US Newswire$____ (1.6) ^ravag ed$____ (1.6) Filter 93 (bias = -0.43) #
E
l
W
f
w
n
G
F
"
L
2
h
Q
m
O
j
z
t
\163
A
9
H
3
x
<BOS>
,
Z
C
c
d
X
.
8
D
K
/
U
P
0
i
B
g
o
j
l
e
N
E
/
F
K
-
&
)
T
G
,
s
t
p
\194
V
A
?
z
b
D
\163
L
4
Y
\$
U
S
a
r
n
!
(
d
w
y
X
d
B
h
W
p
b
C
K
s
J
F
Z
i
v
u
V
t
E
g
z
r
Q
f
9
n
M
H
N
D
\194
c
2
,
Y
P
6
1
X
c
5
u
6
k
S
U
4
v
7
T
3
g
Q
y
2
m
W
z
8
d
J
C
K
r
9
t
j
.
e
a
N
-
F
D
V
'
E
R
Non-zero for 12.0% of words.
^ANGELES $____ (3.0) ^DENVE R$____ (2.7) ^Gove rnment$____ (2.7) ^Gove rnor$____ (2.7) ^ECB$ ____ (2.7) ^BEIJI NG$____ (2.3) ^owe d$____(2.3) ^owe $____(2.3) ^owe s$____(2.3) ^Towe r$____ (2.2) ^Moscow$ ____ (2.2) ^cow$ ____ (2.2) ^vowe d$____ (2.2) ^Elvi s$____ (2.2) ^Kobe $____ (2.2) ^wave $____ (2.1) ^wave s$____ (2.1) ^wave d$____ (2.1) ^airwave s$____ (2.1) ^renewabl e$____ (2.1) ^WAM$ ____ (2.0) ^Rowe $____ (2.0) ^Glasgow$ ____ (2.0) ^obj ect$____(2.0) ^obj ects$____(2.0) ^obj ective$____(2.0) ^obj ections$____(2.0) ^obe sity$____(2.0) ^obe se$____(2.0) ^Jacob$ ____ (1.9) ^UBS $____(1.9) ^cove r$____ (1.9) ^recove ry$____ (1.9) ^discove red$____ (1.9) ^cove rage$____ (1.9) ^Twe nty20$____(1.9) ^Twe nty$____(1.9) ^Twe lve$____(1.9) ^Envi ronmental$____ (1.9) ^Envi ronment$____ (1.9) Filter 94 (bias = -0.63) #
.
f
A
J
Q
P
W
j
Z
-
w
p
c
u
/
i
a
v
z
o
X
R
^
k
g
e
C
r
M
F
\195
T
x
l
l
x
H
f
D
k
J
a
j
v
-
p
L
b
/
E
n
W
Y
c
Z
B
7
s
1
\163
R
w
&
y
(
e
M
S
X
F
C
"
I
K
X
c
Q
y
L
u
l
-
7
t
6
k
5
T
/
g
V
f
\194
v
Y
r
A
p
Z
d
9
M
8
m
B
e
4
s
K
o
$
\163
h
r
v
H
x
k
d
E
\194
A
.
i
c
U
6
O
0
T
-
R
D
M
n
P
z
g
5
Y
8
I
o
y
9
t
l
S
7
\195
'
j
L
Non-zero for 17.9% of words.
^Alli ance$____ (3.0) ^Alli ed$____ (3.0) ^Alli son$____ (3.0) ^DALLA S$____ (2.8) ^Alle n$____ (2.6) ^U.S.-le d$____ (2.4) ^All$ ____ (2.2) ^declar ed$____ (2.1) ^declar ation$____ (2.1) ^declar e$____ (2.1) ^declar ing$____ (2.1) ^clar ity$____ (2.1) ^clar ify$____ (2.1) ^calli ng$____ (2.1) ^falli ng$____ (2.1) ^alli es$____ (2.1) ^alli ance$____ (2.1) ^salar y$____ (2.1) ^salar ies$____ (2.1) ^alar m$____ (2.1) ^alar ming$____ (2.1) ^malar ia$____ (2.1) ^alar med$____ (2.1) ^Alla n$____ (2.1) ^ALL$ ____ (2.0) ^lar ge$____(1.9) ^lar gest$____(1.9) ^lar gely$____(1.9) ^lar ger$____(1.9) ^burglar y$____ (1.9) ^Al$_ ___ (1.9) ^Clar k$____ (1.9) ^Clar ke$____ (1.9) ^Clar kson$____ (1.9) ^Clar ence$____ (1.9) ^Har ry$____(1.9) ^Har ris$____(1.9) ^Har vard$____(1.9) ^Har t$____(1.9) ^Dar ling$____(1.8) Filter 95 (bias = -0.41) #
9
m
R
g
Q
v
F
-
8
T
I
.
N
w
7
h
,
y
P
t
3
u
C
M
<BOS>
l
5
c
2
o
4
i
S
b
B
Y
K
d
6
z
Y
F
;
s
o
d
T
f
O
e
W
\$
R
j
r
c
H
x
"
.
m
5
i
E
\195
D
-
C
M
)
X
n
:
2
G
t
&
S
?
L
l
y
X
c
J
x
\194
p
Y
h
j
f
z
r
T
k
I
C
/
n
Q
F
$
d
L
a
V
u
s
U
8
P
'
1
6
r
H
O
1
-
n
t
h
s
7
E
4
z
Z
k
8
f
5
R
L
T
X
S
2
'
q
o
3
U
9
j
x
P
W
m
A
p
0
e
Non-zero for 27.0% of words.
^Rola nd$____ (3.3) ^Roth $____ (3.2) ^Romn ey$____ (3.0) ^Nola n$____ (3.0) ^Fulh am$____ (3.0) ^Noth ing$____ (2.8) ^Pola nd$____ (2.8) ^Pola nski$____ (2.8) ^Roll $____ (2.7) ^Roll ing$____ (2.7) ^Roll ins$____ (2.7) ^Coca-Cola $____ (2.7) ^NY$_ ___ (2.6) ^FOXN ews.com$____ (2.6) ^Foll owing$____ (2.5) ^Roon ey$____ (2.5) ^Cold $____ (2.4) ^Poli ce$____ (2.4) ^Poli sh$____ (2.4) ^Poli cy$____ (2.4) ^Poli tical$____ (2.4) ^Poli tics$____ (2.4) ^Poli ticians$____ (2.4) ^FOX$ ____ (2.4) ^Sola r$____ (2.4) ^Col. $____ (2.4) ^Col$ ____ (2.3) ^Coli n$____ (2.3) ^Ruth $____ (2.3) ^Fren ch$____ (2.2) ^Fren chman$____ (2.2) ^Motorola $____ (2.2) ^Soth eby$____ (2.2) ^Yon hap$____(2.2) ^Roma n$____ (2.2) ^Roma nia$____ (2.2) ^Roma $____ (2.2) ^Roma nian$____ (2.2) ^Poll s$____ (2.2) ^Poll $____ (2.2) Filter 96 (bias = -0.49) #
J
y
j
t
V
c
<BOS>
.
9
o
b
a
R
p
7
O
4
T
6
A
F
m
3
d
E
D
-
,
Z
W
5
U
8
z
Y
/
0
w
X
K
N
k
(
g
o
p
"
i
&
b
/
V
\194
A
Q
P
8
m
O
j
M
)
:
C
D
w
.
l
u
z
\$
I
9
a
K
n
W
G
?
r
A
f
G
v
z
n
L
R
E
x
w
k
U
-
O
u
W
r
X
F
m
9
g
j
Y
q
K
h
Z
'
Q
o
/
N
S
C
l
P
a
B
Q
p
\194
g
.
i
/
T
'
h
$
J
s
m
C
b
,
P
"
G
N
e
I
y
H
k
r
E
j
M
\195
1
Non-zero for 15.2% of words.
^Joan $____ (3.0) ^enjoy$ ____ (3.0) ^joy$ ____ (3.0) ^enjoys $____ (3.0) ^Joe$ ____ (2.9) ^Job$ ____ (2.9) ^Jobs $____ (2.9) ^elbow$ ____ (2.8) ^bow$ ____ (2.8) ^job$ ____ (2.7) ^Jews $____ (2.7) ^jobs $____ (2.7) ^Join t$____ (2.6) ^Juan $____ (2.5) ^Just ice$____ (2.5) ^Just $____ (2.5) ^Just in$____ (2.5) ^join ed$____ (2.5) ^join $____ (2.5) ^join t$____ (2.5) ^join ing$____ (2.5) ^join s$____ (2.5) ^join tly$____ (2.5) ^Joyc e$____ (2.4) ^DVDs$ ____ (2.4) ^MRSA$ ____ (2.4) ^symbol$ ____ (2.4) ^bols ter$____ (2.4) ^symbols $____ (2.4) ^bols tered$____ (2.4) ^boas ts$____ (2.4) ^boas t$____ (2.4) ^boas ted$____ (2.4) ^FDA$ ____ (2.3) ^marijuan a$____ (2.3) ^just $____ (2.3) ^just ice$____ (2.3) ^adjust ed$____ (2.3) ^just ify$____ (2.3) ^boss $____ (2.3) Filter 97 (bias = -0.55) #
X
l
Q
a
M
A
G
o
g
s
"
u
Z
B
e
U
<BOS>
x
V
L
E
n
j
,
\163
z
2
f
8
t
r
h
7
i
^
v
W
.
4
m
!
D
m
t
-
F
W
c
?
C
b
j
:
0
w
A
X
T
V
h
"
S
;
N
Y
n
Z
,
Q
5
'
B
l
d
o
1
h
-
R
m
k
w
Q
J
"
l
Y
i
C
p
c
e
B
f
\194
j
r
P
8
d
9
K
N
L
A
M
q
2
x
O
'
z
b
o
S
E
l
r
L
y
Y
n
6
f
G
c
5
k
J
t
z
'
S
C
X
M
7
u
E
-
V
N
4
p
0
F
b
P
x
w
\194
q
2
m
8
o
Non-zero for 15.3% of words.
^working-cl ass$____ (3.6) ^middle-cl ass$____ (3.5) ^Assembl y$____ (3.2) ^assembl y$____ (3.2) ^Wembl ey$____ (3.2) ^assembl ed$____ (3.2) ^femal e$____ (3.1) ^femal es$____ (3.1) ^Guatemal a$____ (3.1) ^Formul a$____ (2.8) ^formul a$____ (2.8) ^normal $____ (2.7) ^formal $____ (2.7) ^normal ly$____ (2.7) ^formal ly$____ (2.7) ^basebal l$____ (2.7) ^Basebal l$____ (2.7) ^mul tiple$____(2.7) ^mul tinational$____(2.7) ^mul timedia$____(2.7) ^mal e$____(2.6) ^mal l$____(2.6) ^mal es$____(2.6) ^mal aria$____(2.6) ^free-ki ck$____ (2.4) ^renewal $____ (2.4) ^sidewal k$____ (2.4) ^turbul ent$____ (2.4) ^turbul ence$____ (2.4) ^Wal l$____(2.4) ^Wal es$____(2.4) ^Wal ker$____(2.4) ^Wal -Mart$____(2.4) ^Wal ter$____(2.4) ^verbal $____ (2.4) ^world-cl ass$____ (2.3) ^bul k$____(2.3) ^bul let$____(2.3) ^bul lets$____(2.3) ^bul lying$____(2.3) Filter 98 (bias = -0.56) #
i
c
-
.
u
x
H
N
J
Q
j
F
Y
K
I
8
s
B
1
D
g
q
m
C
k
L
E
a
T
X
U
f
R
b
3
A
4
\163
w
"
V
f
X
o
7
y
Q
p
Y
m
\194
%
H
s
6
O
4
-
Z
c
(
w
C
E
9
K
:
r
5
e
8
U
&
z
x
d
a
M
p
Z
i
J
h
Y
y
K
d
R
a
z
F
N
x
X
f
\194
k
/
H
o
t
.
,
Q
I
G
1
L
P
9
g
$
e
\195
s
"
q
x
-
F
r
c
H
S
T
s
\195
8
u
C
I
5
i
9
l
4
J
6
m
V
O
7
o
f
w
v
t
0
q
\194
P
G
Y
h
M
B
R
Non-zero for 11.9% of words.
^casinos $____ (2.8) ^Latinos $____ (2.8) ^dinos aurs$____ (2.8) ^IMF $____(2.6) ^Hoc key$____(2.5) ^Cox $____(2.5) ^philos ophy$____ (2.4) ^philos ophical$____ (2.4) ^mid-199 0s$____ (2.4) ^6-7$_ ___ (2.3) ^Villaraigos a$____ (2.3) ^7.5 $____(2.3) ^Hos pital$____(2.2) ^Hos pitals$____(2.2) ^Hos sein$____(2.2) ^70s $____(2.1) ^minus $____ (2.1) ^inex pensive$____ (2.0) ^inex perienced$____ (2.0) ^7.4 $____(2.0) ^Coc a-Cola$____(2.0) ^1998 $____ (1.9) ^7.6 $____(1.9) ^WASHING TON$____ (1.9) ^HMRC $____ (1.9) ^DVDs $____ (1.9) ^6.8 $____(1.9) ^4.8 $____(1.9) ^films $____ (1.9) ^Films $____ (1.9) ^Huc kabee$____(1.8) ^1970s $____ (1.8) ^Gibbs $____ (1.8) ^1995 $____ (1.8) ^7-5 $____(1.8) ^7-6$_ ___ (1.8) ^4-6$_ ___ (1.8) ^3-6$_ ___ (1.8) ^6-4$_ ___ (1.8) ^5-4$_ ___ (1.8) Filter 99 (bias = -0.58) #
g
R
w
u
p
9
O
v
m
J
A
B
t
k
y
U
.
8
W
x
X
F
G
7
l
s
/
f
e
b
c
P
K
\195
z
0
\163
6
S
3
-
k
J
p
3
c
6
x
2
A
9
b
(
m
7
g
E
h
u
f
1
a
4
y
X
t
I
C
&
F
8
r
:
.
M
B
5
'
\194
n
Z
-
8
t
L
v
H
o
4
z
Q
p
h
T
F
l
y
j
3
I
X
d
7
i
A
k
"
J
6
w
V
'
W
f
N
O
9
c
2
P
h
E
x
z
F
-
y
w
k
l
f
I
C
J
p
u
V
D
n
d
8
t
'
T
B
.
,
O
H
s
"
U
c
\195
4
v
7
e
9
j
Non-zero for 18.9% of words.
^cost-ef fective$____ (2.3) ^Jay $____(2.0) ^Jay s$____(2.0) ^GMAC $____ (1.9) ^Why $____(1.9) ^Jak e$____(1.9) ^Jak arta$____(1.9) ^go-ah ead$____ (1.9) ^clubh ouse$____ (1.8) ^fuel-ef ficient$____ (1.8) ^follow-up $____ (1.8) ^push $____ (1.8) ^push ed$____ (1.8) ^push ing$____ (1.8) ^push es$____ (1.8) ^guy$ ____ (1.8) ^first-ha lf$____ (1.7) ^right-ha nder$____ (1.7) ^left-ha nder$____ (1.7) ^ISLAMAB AD$____ (1.7) ^NFC $____(1.7) ^yuan $____ (1.7) ^weak $____ (1.6) ^weak ness$____ (1.6) ^weak er$____ (1.6) ^weak ened$____ (1.6) ^RAF $____(1.6) ^century $____ (1.6) ^Century $____ (1.6) ^19th-century $____ (1.6) ^half-century $____ (1.6) ^start-up $____ (1.6) ^hip-ho p$____ (1.6) ^speak $____ (1.6) ^speak ing$____ (1.6) ^peak $____ (1.6) ^Speak ing$____ (1.6) ^gunf ire$____ (1.5) ^PRNewswire-Fi rstCall$____ (1.5) ^turk ey$____ (1.5) Filter 100 (bias = -0.44) #
<BOS>
g
W
h
f
c
K
A
w
C
X
D
Q
T
2
j
I
G
"
r
6
l
,
0
-
Y
/
.
3
k
O
u
P
L
^
t
9
H
N
R
Q
u
X
U
j
o
I
y
g
B
e
f
7
m
2
%
E
h
!
k
5
R
\$
L
V
a
(
K
\194
x
4
\195
6
i
S
s
\163
,
G
n
m
R
W
u
X
r
t
-
p
o
K
g
,
J
/
j
Q
s
\194
G
6
\195
B
E
V
9
a
0
f
c
q
N
y
h
D
3
C
v
F
-
f
Y
P
c
,
g
A
w
H
G
L
0
t
W
l
o
y
z
I
"
U
.
p
\194
S
b
s
9
i
q
C
T
r
x
n
u
/
Non-zero for 15.8% of words.
^Wemb ley$____ (2.7) ^Xav ier$____(2.6) ^Twelv e$____ (2.4) ^gav e$____(2.3) ^Welc ome$____ (2.2) ^giv e$____(2.1) ^giv en$____(2.1) ^giv ing$____(2.1) ^giv es$____(2.1) ^fetc h$____ (2.1) ^Elv is$____(1.9) ^emo tional$____(1.8) ^emo tions$____(1.8) ^emo tion$____(1.8) ^emo tionally$____(1.8) ^Sav e$____(1.8) ^Inv estors$____(1.7) ^Inv estment$____(1.7) ^Inv estigators$____(1.7) ^Inv estigation$____(1.7) ^Inv estor$____(1.7) ^Inv estments$____(1.7) ^Inv erness$____(1.7) ^Weng er$____ (1.7) ^flav or$____ (1.7) ^Webb $____ (1.7) ^Webb er$____ (1.7) ^emb assy$____(1.7) ^emb race$____(1.7) ^emb arrassing$____(1.7) ^emb raced$____(1.7) ^emb arrassment$____(1.7) ^emb arrassed$____(1.7) ^emb edded$____(1.7) ^Gav in$____(1.7) ^felo ny$____ (1.6) ^lifelo ng$____ (1.6) ^env ironment$____(1.6) ^env ironmental$____(1.6) ^env oy$____(1.6) Filter 101 (bias = -0.51) #
<BOS>
F
w
f
Q
D
-
h
W
t
X
y
Z
c
V
d
Y
T
G
C
^
u
<EOS>
L
E
B
2
x
"
,
I
p
v
k
m
P
M
z
h
c
H
d
m
I
f
D
i
&
F
C
V
!
4
.
j
a
Y
G
B
p
S
0
k
Q
W
O
e
R
y
r
b
\163
u
\195
x
U
Q
i
X
k
W
u
"
h
\194
-
D
n
E
J
O
m
t
b
.
R
N
g
c
l
$
f
\163
x
2
s
e
p
T
H
/
\195
I
r
8
P
W
.
I
x
w
L
i
h
T
F
X
l
2
s
O
f
K
u
E
d
1
b
3
c
k
n
Q
r
H
m
p
'
$
A
"
D
t
o
4
j
Non-zero for 16.6% of words.
^whet her$____ (2.2) ^hei ght$____(2.1) ^hei ghtened$____(2.1) ^hei ghts$____(2.1) ^hei r$____(2.1) ^Hew itt$____(2.1) ^wick ets$____ (2.1) ^wick et$____ (2.1) ^Gatwick $____ (2.1) ^Warwick shire$____ (2.1) ^Twick enham$____ (2.1) ^Hei ghts$____(2.1) ^Hei neken$____(2.1) ^Hei di$____(2.1) ^WHO$ ____ (2.1) ^wit$ ____ (2.0) ^lightw eight$____ (2.0) ^Twitt er$____ (2.0) ^fighti ng$____ (2.0) ^lighti ng$____ (2.0) ^highlighti ng$____ (2.0) ^Fighti ng$____ (2.0) ^FDI C$____(2.0) ^Alzhei mer$____ (2.0) ^Whet her$____ (1.9) ^View $____ (1.9) ^whee l$____ (1.9) ^whee ls$____ (1.9) ^whee lchair$____ (1.9) ^few $____(1.9) ^few er$____(1.9) ^McQ ueen$____(1.8) ^M.$ ____(1.8) ^FOX $____(1.7) ^FOX News.com$____(1.7) ^e-mai l$____ (1.7) ^e-mai ls$____ (1.7) ^Switz erland$____ (1.7) ^HD$ ____(1.7) ^BEI JING$____(1.7) Filter 102 (bias = -0.47) #
x
t
8
T
7
w
6
o
F
u
9
O
Q
-
V
i
5
m
4
A
X
g
L
r
b
k
\194
M
"
I
3
z
Z
c
P
H
S
U
d
y
h
w
x
E
Y
U
7
O
C
t
V
z
b
K
8
s
\194
I
q
e
!
%
H
f
p
-
?
M
Q
2
'
o
n
J
g
T
l
i
X
W
R
f
Y
p
D
x
j
m
H
a
g
y
Z
b
0
W
C
v
J
P
u
K
7
d
1
,
M
w
G
e
4
'
9
s
T
k
3
q
A
\163
\194
y
I
h
Q
p
D
r
z
f
5
m
6
k
/
b
X
x
7
g
9
i
0
H
C
e
$
o
2
a
Z
F
\163
P
O
u
Non-zero for 15.4% of words.
^1970$ ____ (3.5) ^FCC$ ____ (3.4) ^787$ ____ (3.4) ^1977$ ____ (3.3) ^1971$ ____ (3.3) ^Exxon $____ (3.2) ^1980$ ____ (3.2) ^800$ ____ (3.2) ^1,800$ ____ (3.2) ^1974$ ____ (3.2) ^FARC $____ (3.1) ^1979$ ____ (3.1) ^1973$ ____ (3.0) ^LCD$ ____ (3.0) ^1987$ ____ (3.0) ^1981$ ____ (3.0) ^700$ ____ (3.0) ^1,700$ ____ (3.0) ^1975$ ____ (3.0) ^Mogadishu$ ____ (2.9) ^1984$ ____ (2.8) ^1970s $____ (2.8) ^850$ ____ (2.8) ^1978$ ____ (2.8) ^600$ ____ (2.8) ^1,600$ ____ (2.8) ^6.7$ ____ (2.8) ^1960$ ____ (2.8) ^6.1$ ____ (2.8) ^7.4$ ____ (2.8) ^8.5$ ____ (2.8) ^1989$ ____ (2.8) ^excus e$____ (2.8) ^excus es$____ (2.8) ^hun dreds$____(2.7) ^hun dred$____(2.7) ^hun t$____(2.7) ^hun ting$____(2.7) ^1990$ ____ (2.7) ^1983$ ____ (2.7) Filter 103 (bias = -0.67) #
H
c
4
z
i
.
h
O
J
t
3
p
7
T
6
G
V
D
9
y
j
\163
Y
m
u
K
5
r
1
w
n
d
R
U
8
g
F
A
-
a
W
.
H
c
i
g
B
s
X
-
;
j
6
G
1
r
3
D
2
z
,
d
4
o
K
'
P
?
I
S
q
E
Y
t
9
u
7
F
V
v
Q
-
N
l
"
s
X
i
8
v
Z
z
y
m
W
j
7
g
9
J
H
w
r
u
q
t
$
f
/
p
3
o
2
x
K
k
F
d
b
K
d
m
g
B
D
f
C
w
t
J
r
b
c
L
T
M
\163
o
I
l
h
x
E
n
1
Z
H
V
Q
/
y
X
u
\195
F
z
"
N
j
Non-zero for 17.7% of words.
^Harm an$____ (2.5) ^Hayw ard$____ (2.4) ^Shirl ey$____ (2.3) ^Birm ingham$____ (2.2) ^harm $____ (2.1) ^pharm aceutical$____ (2.1) ^charm $____ (2.1) ^harm ful$____ (2.1) ^Harb or$____ (2.1) ^chief $____ (2.1) ^Chief $____ (2.1) ^chief s$____ (2.1) ^Chief s$____ (2.1) ^Wyo ming$____(2.0) ^firm $____ (1.9) ^confirm ed$____ (1.9) ^firm s$____ (1.9) ^confirm $____ (1.9) ^vehicl es$____ (1.9) ^vehicl e$____ (1.9) ^Haro ld$____ (1.9) ^Harl em$____ (1.9) ^View $____ (1.8) ^WASHING TON$____ (1.8) ^Raym ond$____ (1.8) ^Wem bley$____(1.8) ^VW$_ ___ (1.8) ^hiri ng$____ (1.7) ^unarm ed$____ (1.7) ^harb or$____ (1.7) ^harb our$____ (1.7) ^shiel d$____ (1.7) ^Shiel ds$____ (1.7) ^shipm ents$____ (1.7) ^shipm ent$____ (1.7) ^Whitm an$____ (1.6) ^Hawaii$_ ___ (1.6) ^Wii$_ ___ (1.6) ^Delhi$_ ___ (1.6) ^Shi$_ ___ (1.6) Filter 104 (bias = -0.69) #
7
m
8
-
6
w
9
g
4
o
F
.
Q
t
5
z
X
c
3
v
2
T
1
u
V
y
P
O
C
s
H
r
Z
'
,
M
\194
p
I
G
i
U
W
u
S
D
p
.
O
d
w
R
o
L
g
F
Y
b
M
a
h
)
t
z
4
r
3
\195
5
P
y
Q
m
Z
j
!
,
9
G
8
X
k
L
v
7
a
6
w
5
u
D
x
Q
-
8
t
3
p
j
f
M
b
4
c
G
n
/
s
Z
'
J
m
K
i
S
q
Y
.
O
z
j
y
-
c
J
h
e
C
E
a
l
p
X
k
w
r
M
U
V
x
I
o
6
,
5
A
2
R
Z
"
4
1
7
T
m
8
L
u
\194
O
Non-zero for 14.4% of words.
^Fiel d$____ (3.3) ^Fiel ds$____ (3.3) ^FOX$ ____ (3.3) ^Fiji $____ (3.2) ^Vill a$____ (2.9) ^Vill age$____ (2.9) ^Vill araigosa$____ (2.9) ^Film $____ (2.9) ^Film s$____ (2.9) ^FOXN ews.com$____ (2.8) ^Hill ary$____ (2.8) ^Hill $____ (2.8) ^Hill s$____ (2.8) ^Wi-Fi$_ ___ (2.7) ^View $____ (2.6) ^Fide l$____ (2.6) ^Foll owing$____ (2.6) ^7-6$ ____ (2.6) ^7-5$ ____ (2.5) ^Fire $____ (2.5) ^Fire fighters$____ (2.5) ^Rile y$____ (2.5) ^Bill $____ (2.5) ^Bill y$____ (2.5) ^Bill s$____ (2.5) ^Bill board$____ (2.5) ^ill egal$____(2.4) ^ill $____(2.4) ^ill ness$____(2.4) ^ill egally$____(2.4) ^ill nesses$____(2.4) ^ill icit$____(2.4) ^ill ustrated$____(2.4) ^ill ustrate$____(2.4) ^1947$ ____ (2.4) ^1957$ ____ (2.4) ^Lill y$____ (2.4) ^Xinj iang$____ (2.3) ^1946$ ____ (2.3) ^1945$ ____ (2.3) Filter 105 (bias = -0.38) #
y
J
m
I
.
B
g
5
"
2
'
,
b
0
h
l
Q
z
r
9
Y
s
c
E
p
U
W
i
X
3
\163
K
q
j
-
6
M
D
G
L
K
g
N
h
9
t
B
m
J
T
P
v
R
.
Z
i
8
-
n
d
\195
S
f
k
3
j
o
s
L
u
5
H
/
Y
,
'
r
W
2
\163
Q
h
z
f
X
y
/
g
\194
m
I
p
A
k
a
j
N
i
L
M
$
-
W
u
9
F
7
e
K
c
r
v
n
H
x
Y
r
m
c
\194
p
V
I
M
a
X
P
W
d
6
A
$
C
h
E
l
U
"
\163
H
t
4
z
b
D
N
O
y
_
n
Non-zero for 20.8% of words.
^U.K.$ ____ (3.3) ^McNam ee$____ (3.2) ^D-N.Y .$____ (3.1) ^dynam ic$____ (3.1) ^dynam ics$____ (3.1) ^N.Y .$____(3.0) ^U.N.$ ____ (2.9) ^tournam ent$____ (2.5) ^tournam ents$____ (2.5) ^Kab ul$____(2.5) ^K.$ ____(2.5) ^program $____ (2.4) ^program s$____ (2.4) ^program me$____ (2.4) ^Program $____ (2.4) ^N.J.$ ____ (2.4) ^Jam es$____(2.3) ^Jam ie$____(2.3) ^Jam aica$____(2.3) ^Magna$ ____ (2.3) ^Kai ne$____(2.3) ^Kai ser$____(2.3) ^NL$ ____(2.3) ^Samoa$ ____ (2.3) ^NW$ ____(2.2) ^signal $____ (2.2) ^signal s$____ (2.2) ^signal ed$____ (2.2) ^signal led$____ (2.2) ^Abram ovich$____ (2.2) ^Pam ela$____(2.2) ^KAB UL$____(2.2) ^Stockholm $____ (2.2) ^Ram irez$____(2.1) ^Ram s$____(2.1) ^Ram os$____(2.1) ^Ram sey$____(2.1) ^Nav y$____(2.1) ^Nav al$____(2.1) ^nicknam e$____ (2.1) Filter 106 (bias = -0.56) #
F
-
B
d
S
w
M
p
t
E
C
g
\194
u
V
o
A
a
<BOS>
\195
k
2
/
J
m
1
Q
r
Y
G
X
z
T
I
,
i
L
\163
h
O
A
f
!
v
a
x
?
-
D
'
Q
o
H
j
U
M
G
S
L
s
y
k
1
n
I
J
r
\194
E
i
Z
m
2
B
X
V
g
F
\163
u
W
L
K
g
\194
d
"
j
k
D
'
F
$
E
M
l
,
A
w
h
B
.
X
e
Q
u
O
r
t
s
/
G
i
b
o
J
Y
0
\195
S
-
Q
d
V
c
k
n
Y
o
X
u
B
v
$
D
W
.
F
g
t
1
K
y
"
q
,
x
O
0
h
p
a
w
\195
Non-zero for 28.4% of words.
^UAW$ ____ (2.7) ^ISLAMABA D$____ (2.7) ^Fait h$____ (2.6) ^FA$_ ___ (2.6) ^FIFA$_ ___ (2.6) ^UEFA$_ ___ (2.6) ^MOSCOW$ ____ (2.6) ^Batt alion$____ (2.5) ^Batt le$____ (2.5) ^SAN$ ____ (2.5) ^NBA$_ ___ (2.4) ^BA$_ ___ (2.4) ^VAT$ ____ (2.4) ^Bank $____ (2.3) ^Bank s$____ (2.3) ^Bank ing$____ (2.3) ^Bank ers$____ (2.3) ^GMAC$ ____ (2.3) ^FEMA $____ (2.3) ^Frit zl$____ (2.3) ^Batm an$____ (2.2) ^Sao$ ____ (2.2) ^USA$_ ___ (2.2) ^NASA$_ ___ (2.2) ^FSA$_ ___ (2.2) ^TSA$_ ___ (2.2) ^habitat$ ____ (2.2) ^Ban$ ____ (2.2) ^Fans $____ (2.1) ^AM$ ____(2.1) ^MGM$ ____ (2.1) ^Maki ng$____ (2.1) ^Matt $____ (2.1) ^Matt hew$____ (2.1) ^Matt hews$____ (2.1) ^Bail ey$____ (2.1) ^taki ng$____ (2.1) ^undertaki ng$____ (2.1) ^breathtaki ng$____ (2.1) ^Brit ish$____ (2.1) Filter 107 (bias = -0.46) #
<BOS>
g
S
D
Q
H
"
m
f
l
s
T
F
d
W
p
x
-
9
n
\194
A
8
1
E
u
N
h
,
i
B
L
K
y
'
\195
4
r
5
q
x
t
6
w
V
o
7
T
b
A
:
c
P
O
d
g
8
N
Q
M
X
r
p
D
\194
E
F
z
f
.
9
U
"
?
!
K
;
u
'
y
C
-
A
f
4
v
5
o
S
m
V
u
7
b
H
x
g
d
1
.
F
r
Z
\195
G
p
0
a
Y
P
2
'
3
w
h
J
X
q
Q
z
-
y
J
f
Y
F
j
c
V
a
g
N
v
K
R
A
u
,
i
U
l
L
\194
t
b
B
'
O
X
x
$
p
0
.
7
D
\195
\163
M
e
Non-zero for 13.5% of words.
^45- year-old$____(2.8) ^next- generation$____ (2.7) ^34- year-old$____(2.6) ^24- year-old$____(2.5) ^24- hour$____(2.5) ^35- year-old$____(2.5) ^25- year-old$____(2.4) ^14- year-old$____(2.4) ^37- year-old$____(2.3) ^15- year-old$____(2.3) ^40- year-old$____(2.3) ^27- year-old$____(2.2) ^31- year-old$____(2.2) ^21- year-old$____(2.1) ^PAR IS$____(2.1) ^17- year-old$____(2.1) ^NASCAR $____ (2.1) ^50- year-old$____(2.0) ^1964$ ____ (2.0) ^30- year-old$____(2.0) ^30- year$____(2.0) ^32- year-old$____(2.0) ^Excl uding$____ (2.0) ^NFC$ ____ (2.0) ^33- year-old$____(2.0) ^11- year-old$____(2.0) ^PC$ ____(2.0) ^1974$ ____ (2.0) ^EPA$ ____ (2.0) ^1965$ ____ (1.9) ^20- year-old$____(1.9) ^20- year$____(1.9) ^FAR C$____(1.9) ^22- year-old$____(1.9) ^36- year-old$____(1.9) ^23- year-old$____(1.9) ^ISLAM ABAD$____ (1.9) ^1975$ ____ (1.9) ^38- year-old$____(1.9) ^QC$ ____(1.9) Filter 108 (bias = -0.49) #
K
-
L
v
A
j
U
k
/
p
O
g
Q
f
N
i
z
e
,
b
<EOS>
x
W
u
G
h
a
d
X
'
Z
m
5
n
Y
F
8
q
^
r
Z
j
?
E
!
v
n
S
y
T
.
p
m
e
/
l
U
t
C
i
c
J
L
h
M
x
K
;
w
k
A
-
&
I
(
0
)
q
Q
O
W
d
Z
E
X
P
n
u
/
D
w
p
V
e
N
j
Q
r
Y
-
K
T
5
s
$
F
B
\163
\194
v
4
f
g
t
y
c
c
-
T
u
t
f
D
b
S
P
0
\195
W
x
C
r
O
a
M
l
G
J
5
d
K
n
\194
s
X
m
g
L
Q
R
E
U
A
H
"
i
Non-zero for 20.6% of words.
^Lync h$____ (3.1) ^Knic ks$____ (2.3) ^Ann$ ____ (2.2) ^ATLANT A$____ (2.1) ^AZUZ$_ ___ (2.1) ^Lanc ashire$____ (2.0) ^Lanc e$____ (2.0) ^Lanc aster$____ (2.0) ^Anne $____ (2.0) ^Zac h$____(2.0) ^Anit a$____ (1.9) ^Unt il$____(1.9) ^Anot her$____ (1.9) ^Kyot o$____ (1.8) ^Amne sty$____ (1.8) ^Anc elotti$____(1.8) ^Anc horage$____(1.8) ^Unit ed$____ (1.7) ^Unit e$____ (1.7) ^Unit $____ (1.7) ^Ayat ollah$____ (1.7) ^UAW$ ____ (1.7) ^An$_ ___ (1.6) ^Ky$_ ___ (1.6) ^Anni e$____ (1.6) ^DALLAS $____ (1.6) ^Nanc y$____ (1.6) ^U.K.$_ ___ (1.6) ^K.$_ ___ (1.6) ^synt hetic$____ (1.5) ^L.$_ ___ (1.5) ^N.F.L.$_ ___ (1.5) ^organic $____ (1.5) ^panic $____ (1.5) ^Hispanic $____ (1.5) ^mechanic al$____ (1.5) ^guardian.c o.uk$____ (1.5) ^A.$_ ___ (1.5) ^L.A.$_ ___ (1.5) ^McCann$ ____ (1.5) Filter 109 (bias = -0.64) #
J
c
X
y
Z
f
7
x
H
o
V
t
6
p
I
.
<BOS>
v
2
s
4
h
3
'
1
m
9
O
5
S
j
k
Y
\163
L
r
P
T
Q
a
t
G
o
b
W
P
-
L
v
g
M
A
f
r
\194
Z
w
z
i
)
u
V
'
U
,
\195
(
F
T
%
O
!
"
7
N
a
n
p
y
J
L
x
D
f
U
i
Y
p
T
n
G
v
m
h
Z
w
Q
W
z
k
r
4
X
S
/
s
\195
5
M
F
R
,
l
a
P
q
.
t
K
2
F
v
f
T
S
-
s
z
,
g
4
o
3
d
8
t
N
c
K
D
Z
l
5
Y
L
q
Q
0
U
u
y
.
P
\194
7
m
9
w
B
p
Non-zero for 22.8% of words.
^Hors e$____ (2.4) ^colorf ul$____ (2.3) ^Jobs $____ (2.3) ^HOUS TON$____ (2.3) ^MLS $____(2.1) ^Hous e$____ (2.1) ^Hous ton$____ (2.1) ^Hous ing$____ (2.1) ^EDF $____(2.1) ^Home $____ (2.0) ^Home land$____ (2.0) ^Home s$____ (2.0) ^Holy $____ (2.0) ^Jers ey$____ (2.0) ^Horn ets$____ (2.0) ^Horn $____ (2.0) ^majors $____ (1.9) ^July $____ (1.9) ^N.F .L.$____(1.9) ^Jour nal$____ (1.8) ^Pors che$____ (1.8) ^IMF $____(1.8) ^EBITDA $____ (1.8) ^colors $____ (1.8) ^sailors $____ (1.8) ^councillors $____ (1.8) ^ISLA MABAD$____ (1.7) ^Joli e$____ (1.7) ^Half $____ (1.7) ^HMRC $____ (1.7) ^PHILA DELPHIA$____ (1.7) ^Golf $____ (1.6) ^Jim$ ____ (1.6) ^poultry $____ (1.6) ^Norf olk$____ (1.6) ^try ing$____(1.6) ^try $____(1.6) ^Perf ormance$____ (1.5) ^donors $____ (1.5) ^governors $____ (1.5) Filter 110 (bias = -0.39) #
J
y
E
m
9
p
<BOS>
f
2
'
7
d
4
s
N
c
B
P
I
U
0
o
q
O
j
x
X
,
5
C
3
t
6
-
R
l
H
.
8
z
U
p
R
g
u
d
B
l
k
x
M
e
"
j
&
G
Q
h
Y
L
N
y
Z
D
;
S
)
c
9
0
\194
5
(
%
'
\$
W
\163
H
F
A
f
q
F
a
y
Y
M
w
P
I
p
z
m
W
c
l
S
H
K
.
'
v
e
b
s
\195
C
B
j
u
O
J
D
\194
,
2
r
0
d
Q
p
"
t
8
m
N
k
9
g
3
z
X
l
7
c
W
i
4
n
E
v
6
C
Y
f
Z
w
$
s
2
A
H
d
M
'
R
x
\194
a
Non-zero for 12.3% of words.
^ERA$ ____ (3.7) ^TEHRAN $____ (3.6) ^IRA$ ____ (3.3) ^NBA$ ____ (3.2) ^Juar ez$____ (3.0) ^FEMA$ ____ (3.0) ^FRAN CISCO$____ (2.8) ^UAW $____(2.7) ^UAE $____(2.6) ^Juve ntus$____ (2.6) ^EMI$ ____ (2.6) ^quar ter$____ (2.5) ^headquar ters$____ (2.5) ^squar e$____ (2.5) ^Squar e$____ (2.5) ^ETA$ ____ (2.5) ^July $____ (2.5) ^U.N .$____(2.5) ^RBI$ ____ (2.4) ^BAE $____(2.3) ^KABUL$ ____ (2.3) ^ATLANTA$ ____ (2.3) ^Euge ne$____ (2.2) ^Jacqui$ ____ (2.2) ^BMW$ ____ (2.2) ^BA$ ____(2.2) ^Nicaragua$ ____ (2.2) ^Antigua$ ____ (2.2) ^Xinhua$ ____ (2.1) ^Joshua$ ____ (2.1) ^juve nile$____ (2.1) ^CITY$ ____ (2.1) ^\226\128\162$ ____ (2.1) ^Kabul$ ____ (2.0) ^Istanbul$ ____ (2.0) ^BEIJIN G$____ (2.0) ^June $____ (2.0) ^quo$ ____ (2.0) ^Rau l$____(1.9) ^But$ ____ (1.9) Filter 111 (bias = -0.55) #
-
A
<BOS>
L
'
h
Q
B
"
K
\194
E
R
5
d
a
I
y
P
F
v
S
^
4
u
G
r
c
U
x
3
0
w
g
M
p
N
d
H
-
Z
v
A
x
X
s
/
c
K
g
B
P
(
z
L
'
W
k
5
b
4
\163
Y
f
&
u
3
a
Q
r
w
G
6
C
g
y
z
f
I
M
Q
F
Y
K
l
L
j
m
V
n
\194
u
T
h
G
,
X
o
v
N
t
x
b
U
0
H
E
3
k
8
$
B
R
P
v
H
p
L
"
n
x
A
b
y
T
Z
k
u
\194
U
W
/
z
1
\163
h
c
m
Q
F
0
.
E
C
X
M
9
i
q
,
'
N
B
l
Non-zero for 10.8% of words.
^D-N.Y .$____ (3.6) ^T-Mob ile$____ (3.0) ^Mav ericks$____(2.9) ^Wal-Mar t$____ (2.8) ^WASHINGT ON$____ (2.7) ^Nav y$____(2.7) ^Nav al$____(2.7) ^Hav ing$____(2.6) ^Hav e$____(2.6) ^Hav en$____(2.6) ^Hav ana$____(2.6) ^10-K$_ ___ (2.6) ^Nev ada$____(2.6) ^Nev ertheless$____(2.6) ^Nev er$____(2.6) ^Nev ille$____(2.6) ^al-Mal iki$____ (2.5) ^HIV $____(2.5) ^Age ncy$____(2.4) ^Age $____(2.4) ^Age nt$____(2.4) ^Adv anced$____(2.4) ^Adv isory$____(2.4) ^Adv ertising$____(2.4) ^Alb ert$____(2.4) ^Alb erto$____(2.4) ^Alb any$____(2.4) ^Alb erta$____(2.4) ^NAT O$____(2.4) ^NEW $____(2.3) ^MTV $____(2.3) ^Aze rbaijan$____(2.3) ^AIG $____(2.3) ^NYS E$____(2.3) ^MI5 $____(2.2) ^Xav ier$____(2.2) ^RBI$ ____ (2.2) ^MVP $____(2.2) ^Alz heimer$____(2.2) ^Abb as$____(2.2) Filter 112 (bias = -0.45) #
u
x
R
p
H
m
r
K
t
b
E
f
I
l
T
L
-
V
N
v
U
z
Q
G
"
6
j
5
O
B
<BOS>
X
M
n
D
P
s
c
Y
a
K
F
W
j
z
g
X
k
&
r
Y
n
"
t
o
h
O
f
\194
C
G
H
Q
e
/
s
6
I
:
d
8
p
9
\$
L
u
!
i
w
-
S
-
G
u
Q
v
L
n
A
J
X
w
/
f
Y
k
l
q
O
r
K
R
C
o
z
i
5
\195
\194
j
V
e
$
N
M
H
B
y
v
O
j
r
J
Q
l
K
-
"
B
G
x
c
u
/
i
Z
s
N
k
p
b
W
0
X
t
$
F
.
f
8
V
A
6
e
\194
Non-zero for 12.5% of words.
^Holy $____ (2.7) ^TOKYO $____ (2.0) ^fully $____ (2.0) ^successfully $____ (2.0) ^carefully $____ (2.0) ^hopefully $____ (2.0) ^paraly zed$____ (2.0) ^proxy $____ (2.0) ^Kay $____(2.0) ^Italy $____ (1.9) ^Oly mpic$____(1.9) ^Oly mpics$____(1.9) ^Way ne$____(1.9) ^Way $____(1.9) ^firmly $____ (1.9) ^crazy $____ (1.9) ^holy $____ (1.9) ^prototy pe$____ (1.9) ^custody $____ (1.9) ^Sarkozy $____ (1.9) ^Buzz$ ____ (1.8) ^buzz$ ____ (1.8) ^Why $____(1.8) ^democracy $____ (1.7) ^conspiracy $____ (1.7) ^Tracy $____ (1.7) ^piracy $____ (1.7) ^proudly $____ (1.6) ^loudly $____ (1.6) ^Airway s$____ (1.6) ^Norway $____ (1.6) ^underway $____ (1.6) ^motorway $____ (1.6) ^previously $____ (1.6) ^seriously $____ (1.6) ^obviously $____ (1.6) ^Obviously $____ (1.6) ^fantasy $____ (1.6) ^Rory $____ (1.6) ^Key $____(1.5) Filter 113 (bias = -0.47) #
s
x
E
q
O
B
D
p
z
b
-
h
U
k
G
n
u
a
S
H
<BOS>
V
j
W
M
X
t
f
o
i
d
7
R
9
T
v
.
6
c
8
w
h
2
c
3
v
I
d
K
y
J
.
Z
x
E
T
X
u
5
D
V
t
4
q
W
m
Q
p
6
r
9
o
O
l
7
g
i
?
G
k
Y
f
h
F
o
P
l
I
g
n
G
e
S
U
O
w
T
2
A
Z
\194
k
L
d
W
s
"
B
$
E
0
9
m
N
-
a
u
g
P
.
f
G
B
"
p
Q
k
W
l
E
i
$
n
Z
r
X
,
\194
t
w
\195
4
J
c
T
Y
U
S
I
8
F
a
K
R
Non-zero for 18.1% of words.
^TOKYO $____ (3.1) ^two$ ____ (2.5) ^Bowl$ ____ (2.5) ^bowl$ ____ (2.5) ^Two$ ____ (2.4) ^fossil$ ____ (2.3) ^Dwig ht$____ (2.3) ^DETROIT$ ____ (2.3) ^Vog ue$____(2.2) ^nowhe re$____ (2.2) ^Gigg s$____ (2.2) ^Picasso$ ____ (2.2) ^two- thirds$____ (2.2) ^two- year$____ (2.2) ^two- run$____ (2.2) ^two- day$____ (2.2) ^Oil$ ____ (2.0) ^Bush$ ____ (2.0) ^push$ ____ (2.0) ^rush$ ____ (2.0) ^Rush$ ____ (2.0) ^brush$ ____ (2.0) ^crush$ ____ (2.0) ^ambush$ ____ (2.0) ^Brazil$ ____ (2.0) ^Volkswag en$____ (2.0) ^Assoc iation$____ (2.0) ^Assoc iated$____ (2.0) ^assoc iated$____ (2.0) ^assoc iation$____ (2.0) ^assoc iate$____ (2.0) ^assoc iates$____ (2.0) ^Assoc iates$____ (2.0) ^assoc iations$____ (2.0) ^Jo$ ____(1.9) ^Il$ ____(1.9) ^spokeswom an$____ (1.9) ^Ipswic h$____ (1.9) ^ANGELE S$____ (1.9) ^NY$ ____(1.9) Filter 114 (bias = -0.52) #
Q
k
X
f
<BOS>
g
\194
w
7
m
6
r
"
t
/
c
^
p
8
i
Y
o
W
u
h
v
n
-
y
e
j
s
p
u
e
Y
r
\194
F
o
b
i
E
t
g
&
\163
/
P
U
G
T
\$
H
f
v
?
B
x
n
!
R
Q
,
j
(
y
M
c
W
8
l
Y
p
V
f
\194
y
H
c
$
o
j
P
X
K
Q
d
4
r
R
O
u
a
e
x
D
N
\163
,
F
n
w
w
f
Z
x
z
p
g
F
G
h
M
l
.
P
c
S
W
,
X
r
$
d
V
o
U
i
T
j
Q
y
Y
L
E
a
m
e
B
s
Non-zero for 24.6% of words.
^puz zle$____(3.1) ^rug by$____(2.9) ^rug ged$____(2.9) ^egg s$____(2.9) ^egg $____(2.9) ^piz za$____(2.8) ^puc k$____(2.8) ^pig s$____(2.8) ^pig $____(2.8) ^eig ht$____(2.6) ^eig hth$____(2.6) ^eig ht-year$____(2.6) ^buz z$____(2.6) ^bug $____(2.6) ^rig ht$____(2.6) ^rig hts$____(2.6) ^rig ht-wing$____(2.6) ^rig orous$____(2.6) ^Eug ene$____(2.6) ^Fig ures$____(2.5) ^Fig hting$____(2.5) ^pic k$____(2.5) ^pic ture$____(2.5) ^pic ked$____(2.5) ^pic tures$____(2.5) ^Ipsw ich$____ (2.5) ^biz arre$____(2.3) ^buc k$____(2.3) ^buc ket$____(2.3) ^big $____(2.3) ^big gest$____(2.3) ^big ger$____(2.3) ^ric h$____(2.3) ^ric e$____(2.3) ^ric hest$____(2.3) ^ric her$____(2.3) ^pum p$____(2.3) ^pum ped$____(2.3) ^pum ping$____(2.3) ^pum ps$____(2.3) Filter 115 (bias = -0.48) #
T
.
C
f
k
x
z
L
R
e
Y
F
U
h
G
l
0
r
v
y
V
N
i
a
\194
b
1
q
9
A
I
m
c
n
J
-
W
d
"
p
C
v
g
x
n
a
Z
B
j
E
V
b
M
o
c
u
H
f
?
%
!
q
/
U
A
;
G
W
'
T
F
\195
5
z
)
J
(
"
X
d
b
D
Q
1
V
i
B
d
'
T
"
g
x
-
X
y
s
o
\194
t
U
p
Z
h
K
H
S
u
.
c
F
j
R
0
Y
e
/
2
$
w
o
H
x
F
G
u
z
M
l
e
Y
t
S
y
p
T
O
m
0
U
5
k
K
Z
\194
D
9
f
c
j
s
q
R
P
7
I
8
E
J
r
Non-zero for 25.2% of words.
^Robinso n$____ (2.9) ^Wilkinso n$____ (2.9) ^Parkinso n$____ (2.9) ^Tomlinso n$____ (2.9) ^onbo ard$____ (2.9) ^Unfo rtunately$____ (2.9) ^Abo ut$____(2.9) ^backgro und$____ (2.8) ^backgro unds$____ (2.8) ^info rmation$____ (2.8) ^info rmed$____ (2.8) ^info rm$____ (2.8) ^info rmal$____ (2.8) ^reinfo rced$____ (2.8) ^Xbo x$____(2.7) ^CBS $____(2.6) ^Craigsl ist$____ (2.5) ^Info rmation$____ (2.5) ^Clo se$____(2.5) ^Clo oney$____(2.5) ^CHICAG O$____ (2.5) ^Lisbo n$____ (2.4) ^Citigro up$____ (2.4) ^Wimbl edon$____ (2.3) ^Tyso n$____ (2.3) ^glo bal$____(2.3) ^glo be$____(2.3) ^glo bally$____(2.3) ^glo ry$____(2.3) ^Cro wn$____(2.3) ^Cro ss$____(2.3) ^Cro atia$____(2.3) ^Cro sby$____(2.3) ^Cro wley$____(2.3) ^insp ired$____ (2.2) ^insp ectors$____ (2.2) ^insp iration$____ (2.2) ^insp ection$____ (2.2) ^Redknap p$____ (2.2) ^unfo rtunate$____ (2.2) Filter 116 (bias = -0.59) #
R
.
C
h
G
m
Q
-
K
q
9
e
U
g
O
x
z
v
8
t
3
b
<BOS>
H
S
f
P
l
5
y
7
u
"
w
Y
d
<EOS>
n
0
a
D
f
H
w
Y
k
L
s
y
x
T
B
X
v
1
b
h
E
d
-
G
n
/
a
l
I
m
N
!
F
&
e
\194
R
7
'
M
J
Z
9
.
i
c
P
g
f
G
p
D
H
j
u
Q
,
A
B
Z
J
z
k
F
o
E
1
C
3
S
\195
e
a
t
U
w
R
0
W
\163
y
X
h
w
d
K
D
V
.
W
c
M
u
S
l
4
h
3
r
E
T
i
q
5
v
2
a
f
y
Z
L
J
t
B
A
X
C
s
g
6
p
e
-
Non-zero for 13.7% of words.
^D.$ ____(2.8) ^1.4 $____(2.8) ^1.3 $____(2.8) ^G.M .$____(2.7) ^1.5 $____(2.6) ^1.2 $____(2.5) ^1.2 5$____(2.5) ^ISLAM ABAD$____ (2.4) ^Hew itt$____(2.4) ^H.$ ____(2.4) ^CDC$ ____ (2.4) ^7.4 $____(2.4) ^Lew is$____(2.3) ^L.$ ____(2.3) ^6.4 $____(2.3) ^6.3 $____(2.3) ^Dwi ght$____(2.2) ^1.6 $____(2.2) ^Def ense$____(2.2) ^Def ence$____(2.2) ^Def ending$____(2.2) ^7.5 $____(2.2) ^T.$ ____(2.2) ^0.4 $____(2.1) ^CDs$ ____ (2.1) ^0.3 $____(2.1) ^LAS $____(2.1) ^1.9 $____(2.1) ^7.2 $____(2.1) ^DAX $____(2.1) ^6.5 $____(2.1) ^6.2 $____(2.0) ^USDA$ ____ (2.0) ^UCLA$ ____ (2.0) ^Ryde r$____ (2.0) ^8.5 $____(2.0) ^Des pite$____(2.0) ^Des ign$____(2.0) ^Des $____(2.0) ^Des ert$____(2.0) Filter 117 (bias = -0.53) #
K
h
Z
t
8
T
5
u
Q
-
X
k
6
r
L
g
G
q
/
v
2
H
9
j
U
i
3
l
<BOS>
o
7
p
<EOS>
b
V
.
z
R
4
d
i
c
H
.
I
m
4
K
E
v
2
o
1
b
3
M
W
z
h
?
u
'
7
n
s
r
6
D
S
Z
:
f
,
G
a
C
\$
N
d
B
X
x
M
a
Z
n
Q
f
J
t
Y
s
G
p
V
i
"
h
j
o
P
,
E
c
b
v
$
A
r
d
D
C
T
y
L
k
\195
w
7
S
y
j
W
J
u
S
H
g
U
r
"
p
m
l
a
R
T
f
.
I
q
-
h
G
\194
s
Y
n
Q
F
d
5
/
0
X
P
$
o
L
C
Non-zero for 18.8% of words.
^Liby a$____ (3.4) ^Liby an$____ (3.4) ^Kiba ki$____ (3.2) ^Ki-m oon$____ (2.9) ^Lieu tenant$____ (2.7) ^Kiev $____ (2.6) ^Kim$ ____ (2.6) ^JERUSALEM$ ____ (2.6) ^Hey $____(2.6) ^Gira rdi$____ (2.6) ^iTu nes$____(2.5) ^Kabu l$____ (2.4) ^Kirc hner$____ (2.3) ^Lib$ ____ (2.3) ^simu ltaneously$____ (2.2) ^dairy $____ (2.2) ^iPa d$____(2.2) ^Gary $____ (2.2) ^24-y ear-old$____ (2.1) ^34-y ear-old$____ (2.1) ^desira ble$____ (2.1) ^Pira tes$____ (2.1) ^iPh one$____(2.1) ^iPh ones$____(2.1) ^imm ediately$____(2.1) ^imm ediate$____(2.1) ^imm igration$____(2.1) ^imm igrants$____(2.1) ^easily $____ (2.1) ^22-y ear-old$____ (2.0) ^AZUZ$ ____ (2.0) ^Kara chi$____ (2.0) ^Kara dzic$____ (2.0) ^Figu res$____ (2.0) ^Ligh t$____ (2.0) ^32-y ear-old$____ (2.0) ^firm $____ (1.9) ^confirm ed$____ (1.9) ^firm s$____ (1.9) ^confirm $____ (1.9) Filter 118 (bias = -0.49) #
<BOS>
p
X
d
Q
-
M
i
Z
s
"
u
V
f
N
o
W
x
Y
y
^
n
\194
a
B
l
8
v
9
m
K
P
4
z
7
g
6
t
c
c
I
L
Q
m
W
v
e
z
2
U
E
o
p
u
q
)
;
x
H
B
X
C
:
D
O
G
r
s
\$
l
\163
0
w
Y
P
M
t
h
i
Y
f
X
y
V
P
\194
t
Q
p
7
F
b
n
l
r
G
u
g
U
$
k
.
e
"
o
W
,
6
c
L
i
d
s
-
K
Y
-
L
j
8
g
U
f
K
k
"
v
Q
w
y
p
O
e
W
m
N
n
/
b
A
t
1
'
3
i
,
J
9
r
D
I
6
P
h
F
Non-zero for 23.3% of words.
^McQu een$____ (2.4) ^McLa ren$____ (2.4) ^clo se$____(2.4) ^clo sed$____(2.4) ^clo sing$____(2.4) ^clo ser$____(2.4) ^cla ims$____(2.3) ^cla imed$____(2.3) ^cla im$____(2.3) ^cla ss$____(2.3) ^cla iming$____(2.3) ^cla ssic$____(2.3) ^cla sses$____(2.3) ^cla shes$____(2.3) ^L.A .$____(2.2) ^Llo yds$____(2.2) ^Llo yd$____(2.2) ^Muba rak$____ (2.2) ^LSU $____(2.2) ^U.K .$____(2.1) ^LG$ ____(2.1) ^MLS$ ____ (2.1) ^DVD $____(2.1) ^DVD s$____(2.1) ^Moga dishu$____ (2.1) ^Nobo dy$____ (2.0) ^Muga be$____ (2.0) ^cho ice$____(2.0) ^cho ose$____(2.0) ^cho sen$____(2.0) ^cho se$____(2.0) ^cho ices$____(2.0) ^cho colate$____(2.0) ^cho osing$____(2.0) ^L.$ ____(2.0) ^U.N .$____(2.0) ^cha nge$____(2.0) ^cha rges$____(2.0) ^cha nce$____(2.0) ^cha rged$____(2.0) Filter 119 (bias = -0.47) #
L
k
5
v
D
B
e
'
4
R
2
T
7
b
6
z
H
t
j
f
F
c
E
p
1
o
d
U
3
C
Z
r
X
m
8
<BOS>
l
u
A
x
y
-
h
w
Y
t
"
I
8
l
Q
J
4
v
V
z
S
j
W
.
C
e
G
\195
H
E
7
o
3
n
x
u
1
q
?
%
6
T
Z
N
F
-
8
g
9
o
Q
m
X
u
B
w
7
t
6
i
N
z
P
s
5
.
4
y
V
c
K
O
2
G
Z
T
3
l
f
d
"
h
b
'
I
f
W
F
O
u
Q
h
2
b
z
x
X
.
p
M
1
j
i
m
G
v
$
r
w
L
,
-
K
n
e
B
k
c
N
Non-zero for 12.5% of words.
^787$ ____ (2.5) ^eye$ ____ (2.4) ^Eye$ ____ (2.2) ^ECB$ ____ (2.1) ^LLP$ ____ (2.1) ^147$ ____ (2.0) ^1982 $____ (2.0) ^1992 $____ (2.0) ^CBI $____(1.9) ^LCD$ ____ (1.9) ^rehea rsal$____ (1.9) ^HSBC $____ (1.9) ^PHILADELPHI A$____ (1.8) ^Lync h$____ (1.8) ^Lib$ ____ (1.8) ^surveyed $____ (1.8) ^1972 $____ (1.8) ^1962 $____ (1.8) ^HBO S$____(1.8) ^HBO $____(1.8) ^1981 $____ (1.8) ^keybo ard$____ (1.8) ^1991 $____ (1.8) ^H1N1 $____ (1.8) ^88$ ____(1.7) ^89$ ____(1.7) ^Lynn $____ (1.7) ^FBI $____(1.7) ^145$ ____ (1.6) ^Lhas a$____ (1.6) ^1971 $____ (1.6) ^eyes $____ (1.6) ^Reyes $____ (1.6) ^1983 $____ (1.6) ^1961 $____ (1.6) ^1993 $____ (1.6) ^1985 $____ (1.6) ^FOX$ ____ (1.6) ^LOND ON$____ (1.6) ^LSU$ ____ (1.6) Filter 120 (bias = -0.47) #
<BOS>
h
Q
a
'
x
R
l
C
L
Z
A
V
q
M
T
"
B
^
y
-
p
/
v
I
o
X
d
i
b
e
H
D
t
d
k
D
r
l
h
L
R
z
E
.
H
!
S
0
Y
6
W
c
u
n
"
v
i
C
O
X
N
/
;
\194
?
5
3
m
M
P
y
Z
o
X
y
Q
s
b
t
J
c
Z
i
7
o
q
u
9
h
N
S
\195
U
V
C
6
k
L
f
I
m
8
T
2
O
l
,
r
p
e
'
B
d
Q
w
r
c
"
m
7
t
X
z
R
v
8
U
9
n
P
i
Y
A
b
B
$
T
N
s
j
C
\194
k
S
g
3
y
_
M
u
Non-zero for 33.5% of words.
^groundbr eaking$____ (3.7) ^der ivatives$____(3.6) ^der $____(3.6) ^der ived$____(3.6) ^der by$____(3.6) ^Der by$____(3.5) ^Der ek$____(3.5) ^Der byshire$____(3.5) ^Der rick$____(3.5) ^Clar k$____ (3.4) ^Clar ke$____ (3.4) ^Clar kson$____ (3.4) ^Clar ence$____ (3.4) ^DJ$ ____(3.2) ^'ll$ ____ (3.2) ^dar k$____(3.2) ^dar kness$____(3.2) ^dar e$____(3.2) ^dar ing$____(3.2) ^Dar ling$____(3.1) ^Dar fur$____(3.1) ^Dar ren$____(3.1) ^Dar win$____(3.1) ^Dar k$____(3.1) ^lar ge$____(3.1) ^lar gest$____(3.1) ^lar gely$____(3.1) ^lar ger$____(3.1) ^zer o$____(3.0) ^second-lar gest$____ (3.0) ^third-lar gest$____ (3.0) ^Chrysler $____ (3.0) ^under $____ (3.0) ^under stand$____ (3.0) ^Under $____ (3.0) ^commander $____ (3.0) ^order $____ (2.9) ^border $____ (2.9) ^murder $____ (2.9) ^order ed$____ (2.9) Filter 121 (bias = -0.58) #
.
r
Z
i
\194
p
c
J
M
I
Q
l
m
j
"
R
X
P
<BOS>
E
/
h
'
k
W
\195
^
H
8
o
v
g
y
A
V
t
6
O
K
T
Q
s
q
f
?
J
r
)
W
j
!
m
"
M
:
v
a
%
(
u
X
-
I
F
A
n
;
L
O
i
H
5
N
U
\163
0
.
x
&
o
p
u
x
A
8
H
G
t
9
U
"
w
c
.
Q
m
7
l
X
T
0
L
6
i
K
I
P
n
5
-
S
h
V
a
\194
B
3
M
O
\195
f
g
F
T
B
D
V
c
k
d
9
h
s
o
K
y
P
.
b
t
Q
-
S
u
x
1
R
q
,
v
Z
G
8
l
3
0
U
H
4
O
Non-zero for 14.4% of words.
^Oxf ord$____(3.0) ^Oxf ordshire$____(3.0) ^caps $____ (3.0) ^maps $____ (2.9) ^Ashcrof t$____ (2.5) ^Ips wich$____(2.5) ^Mack $____ (2.3) ^McCormack $____ (2.2) ^cap$ ____ (2.2) ^Max$ ____ (2.1) ^map$ ____ (2.1) ^aff ected$____(2.1) ^aff ect$____(2.1) ^aff ord$____(2.1) ^aff airs$____(2.1) ^swaps $____ (2.0) ^capa city$____ (2.0) ^capa ble$____ (2.0) ^capa bilities$____ (2.0) ^capa bility$____ (2.0) ^escape $____ (2.0) ^escape d$____ (2.0) ^landscape $____ (2.0) ^capi tal$____ (1.9) ^capi talism$____ (1.9) ^escapi ng$____ (1.9) ^capi ta$____ (1.9) ^ref orm$____(1.9) ^ref used$____(1.9) ^ref lect$____(1.9) ^ref orms$____(1.9) ^ref erring$____(1.9) ^ref erred$____(1.9) ^ref erendum$____(1.9) ^ack nowledged$____(1.9) ^ack nowledge$____(1.9) ^ack nowledges$____(1.9) ^ack nowledging$____(1.9) ^Aff airs$____(1.9) ^aircraf t$____ (1.8) Filter 122 (bias = -0.62) #
B
d
W
p
9
-
V
O
Z
t
b
r
X
y
Y
s
4
D
6
P
q
j
N
g
w
l
v
f
J
S
8
c
3
\163
5
'
H
e
n
G
Y
e
R
F
O
x
o
L
"
f
&
d
Q
b
;
a
T
.
'
v
r
j
(
n
C
l
\194
w
S
q
G
m
/
\$
6
E
2
G
t
K
w
Y
H
L
q
o
n
O
k
J
.
z
u
R
I
l
v
9
-
8
e
S
g
0
d
5
i
3
F
P
T
\195
h
x
m
7
j
N
T
7
k
3
U
X
v
Q
m
5
t
8
z
2
u
4
c
9
s
e
C
6
i
Z
B
j
d
J
'
r
y
n
D
q
p
L
_
-
P
Non-zero for 14.9% of words.
^HBOS$ ____ (3.3) ^Bron x$____ (3.3) ^Bron cos$____ (3.3) ^LeBron $____ (3.3) ^Brow n$____ (2.8) ^Brow ns$____ (2.8) ^Brow ne$____ (2.8) ^Bosn ia$____ (2.8) ^Bosn ian$____ (2.8) ^Box$ ____ (2.7) ^Born $____ (2.7) ^Broo klyn$____ (2.7) ^Broo ks$____ (2.7) ^Broo ke$____ (2.7) ^Broo k$____ (2.7) ^Boll ywood$____ (2.7) ^Bob$ ____ (2.5) ^KABUL$ ____ (2.5) ^BCS$ ____ (2.5) ^Zoo$ ____ (2.4) ^Boar d$____ (2.3) ^BAGH DAD$____ (2.3) ^Broa dway$____ (2.2) ^Broa d$____ (2.2) ^Broa dcasting$____ (2.2) ^Boro ugh$____ (2.2) ^NYSE $____ (2.1) ^Yor k$____(2.1) ^Yor kshire$____(2.1) ^Yor k-based$____(2.1) ^Yor kers$____(2.1) ^Yor ker$____(2.1) ^boxe s$____ (2.1) ^boxe r$____ (2.1) ^Yon hap$____(2.1) ^afternoon $____ (2.1) ^noon $____ (2.1) ^Bobb y$____ (2.0) ^Ror y$____(2.0) ^Boy$ ____ (2.0) Filter 123 (bias = -0.37) #
j
z
6
o
X
U
F
A
e
O
7
s
M
a
4
k
8
K
d
r
H
R
\194
w
D
p
Z
B
<BOS>
t
V
<EOS>
2
G
5
i
Q
,
9
c
w
h
W
d
K
L
k
l
T
x
z
u
E
F
B
D
;
n
X
j
O
7
t
\$
Q
1
M
C
I
8
"
-
V
H
m
y
v
s
U
.
W
m
K
u
O
g
Q
.
9
j
,
-
"
h
B
L
I
F
z
H
o
d
p
M
2
D
3
e
X
l
N
y
8
n
R
b
S
Z
\194
A
X
f
l
y
6
k
J
r
7
F
\194
c
5
t
0
h
2
U
L
s
-
p
I
m
z
'
Y
S
1
u
Z
M
9
B
V
x
D
C
/
,
Non-zero for 15.8% of words.
^renewal $____ (2.7) ^sidewal k$____ (2.7) ^goodwil l$____ (2.2) ^newsl etter$____ (2.1) ^wal k$____(2.1) ^wal ked$____(2.1) ^wal l$____(2.1) ^wal king$____(2.1) ^Wol f$____(1.9) ^Wol ves$____(1.9) ^marketpl ace$____ (1.8) ^wil l$____(1.8) ^wil ling$____(1.8) ^wil d$____(1.8) ^wil dlife$____(1.8) ^jewel ry$____ (1.8) ^jewel lery$____ (1.8) ^farewel l$____ (1.8) ^Cornwal l$____ (1.8) ^statewid e$____ (1.8) ^Wal l$____(1.7) ^Wal es$____(1.7) ^Wal ker$____(1.7) ^Wal -Mart$____(1.7) ^demol ition$____ (1.6) ^demol ished$____ (1.6) ^metal $____ (1.6) ^metal s$____ (1.6) ^retal iation$____ (1.6) ^empl oyees$____ (1.6) ^unempl oyment$____ (1.6) ^empl oyee$____ (1.6) ^empl oyment$____ (1.6) ^empl oyers$____ (1.6) ^revol ution$____ (1.6) ^evol ution$____ (1.6) ^Revol ution$____ (1.6) ^Revol utionary$____ (1.6) ^sewag e$____ (1.6) ^unwil ling$____ (1.6) Filter 124 (bias = -0.44) #
C
m
7
w
R
b
S
.
Q
f
j
a
D
v
I
y
8
B
5
q
0
-
G
W
9
u
F
x
\194
M
4
o
1
e
P
K
3
h
6
n
d
b
t
k
l
B
D
r
\194
f
T
K
-
Z
u
x
&
F
(
V
:
w
I
)
/
N
H
9
Y
?
O
G
i
n
1
p
.
P
\195
X
f
E
k
2
R
e
u
Q
o
W
'
I
n
6
-
5
x
A
r
4
m
D
s
7
v
g
U
L
C
\163
B
H
b
G
\195
1
c
q
i
W
u
X
s
p
A
"
t
Q
U
K
F
b
D
V
j
x
L
q
H
v
l
G
h
Y
R
w
d
\194
C
9
_
8
n
6
r
$
.
Z
I
Non-zero for 19.0% of words.
^Step hen$____ (3.0) ^Step hanie$____ (3.0) ^Clev eland$____ (2.7) ^Stev e$____ (2.4) ^Stev en$____ (2.4) ^Stev ens$____ (2.4) ^Stev enson$____ (2.4) ^NASDAQ $____ (2.4) ^dep artment$____(2.4) ^dep uty$____(2.4) ^dep arture$____(2.4) ^dep loyed$____(2.4) ^dep ression$____(2.4) ^dep ending$____(2.4) ^Stew art$____ (2.3) ^Stap les$____ (2.2) ^Clem ens$____ (2.2) ^Clip pers$____ (2.1) ^Clea rly$____ (2.1) ^Clea n$____ (2.1) ^step $____ (2.1) ^step s$____ (2.1) ^step ped$____ (2.1) ^step ping$____ (2.1) ^DAX $____(2.1) ^Dep artment$____(2.1) ^Dep uty$____(2.1) ^Dep ression$____(2.1) ^Dep osit$____(2.1) ^deb t$____(2.0) ^deb ate$____(2.0) ^deb ut$____(2.0) ^deb ts$____(2.0) ^deb ris$____(2.0) ^slep t$____ (2.0) ^Queb ec$____ (2.0) ^CO2$ ____ (1.9) ^USDA$ ____ (1.9) ^Cleg g$____ (1.9) ^Shep herd$____ (1.9) Filter 125 (bias = -0.43) #
Q
j
"
l
W
J
U
b
y
f
8
k
c
h
/
i
O
-
Z
g
^
x
2
B
X
m
K
v
G
p
\194
F
r
e
n
t
-
A
d
B
p
U
X
k
2
m
0
h
7
s
P
f
D
t
e
u
G
L
\163
n
6
F
1
H
E
a
j
C
Q
,
I
%
O
S
8
b
y
j
/
b
1
F
n
E
W
k
,
S
a
e
Z
f
K
v
C
J
A
r
U
g
H
x
D
s
2
-
L
R
X
V
w
h
d
B
o
l
Y
t
V
p
b
I
m
d
'
N
k
D
"
e
Z
f
M
n
G
,
$
A
\194
a
R
o
Q
F
v
l
U
y
u
q
X
1
z
2
-
L
Non-zero for 25.5% of words.
^dam age$____(2.2) ^dam aged$____(2.2) ^dam aging$____(2.2) ^dam ages$____(2.2) ^2-1$ ____ (2.2) ^already$ ____ (2.2) ^ready$ ____ (2.2) ^steady$ ____ (2.2) ^lady$ ____ (2.2) ^1-1$ ____ (2.1) ^Saddam $____ (2.0) ^da$ ____(2.0) ^adam ant$____ (2.0) ^hadn$ ____ (1.9) ^3-1$ ____ (1.9) ^dom estic$____(1.9) ^dom inated$____(1.9) ^dom inant$____(1.9) ^dom inate$____(1.9) ^6-1$ ____ (1.9) ^therapy$ ____ (1.9) ^chemotherapy$ ____ (1.9) ^Wen$ ____ (1.8) ^2-2$ ____ (1.8) ^dau ghter$____(1.8) ^dau ghters$____(1.8) ^dau nting$____(1.8) ^yen$ ____ (1.8) ^capab le$____ (1.8) ^capab ilities$____ (1.8) ^capab ility$____ (1.8) ^4-1$ ____ (1.8) ^daz zling$____(1.8) ^5-1$ ____ (1.7) ^Canada$ ____ (1.7) ^Nevada$ ____ (1.7) ^Posada$ ____ (1.7) ^spy$ ____ (1.7) ^pa$ ____(1.7) ^study$ ____ (1.7) Filter 126 (bias = -0.49) #
-
F
Y
B
O
f
g
v
G
k
o
e
p
U
y
j
Q
E
"
J
r
t
'
9
^
b
.
P
W
x
M
6
s
u
N
S
y
v
n
\194
?
j
L
E
Z
s
c
;
H
I
A
Q
h
k
N
-
m
R
.
z
!
"
1
V
K
t
D
'
M
Y
r
l
o
J
a
i
F
H
.
-
f
w
c
Y
x
W
D
u
l
1
L
3
t
U
N
J
e
2
S
I
r
4
j
Z
h
V
d
R
A
G
b
\195
v
$
B
X
c
W
-
Q
C
E
g
N
p
2
'
6
k
H
s
K
n
L
f
B
x
"
m
q
v
3
j
8
o
I
d
4
u
7
i
/
h
Z
R
Non-zero for 12.7% of words.
^movie $____ (2.8) ^movie s$____ (2.8) ^Sovie t$____ (2.8) ^Movie $____ (2.8) ^Wachovia $____ (2.6) ^Sie rra$____(2.6) ^Sie mens$____(2.6) ^intervie w$____ (2.5) ^intervie ws$____ (2.5) ^intervie wed$____ (2.5) ^intervie wing$____ (2.5) ^Pelosi$ ____ (2.5) ^rookie $____ (2.4) ^cookie s$____ (2.4) ^vie w$____(2.4) ^vie ws$____(2.4) ^vie wers$____(2.4) ^vie wed$____(2.4) ^Nokia $____ (2.3) ^via $____(2.3) ^via ble$____(2.3) ^via bility$____(2.3) ^sie ge$____(2.3) ^Malaysia $____ (2.3) ^Malaysia n$____ (2.3) ^controversia l$____ (2.2) ^Persia n$____ (2.2) ^Jolie $____ (2.2) ^negotia tions$____ (2.2) ^negotia ting$____ (2.2) ^negotia te$____ (2.2) ^negotia ted$____ (2.2) ^negotia tors$____ (2.2) ^negotia tor$____ (2.2) ^liq uidity$____(2.1) ^liq uid$____(2.1) ^liq uor$____(2.1) ^THE $____(2.1) ^Vie tnam$____(2.0) ^Vie nna$____(2.0) Filter 127 (bias = -0.34) #
E
c
U
n
L
C
s
k
d
o
u
g
P
'
2
p
e
h
a
t
6
q
J
M
I
r
z
v
F
x
3
Y
D
w
4
V
l
N
8
B
U
f
D
p
R
w
Y
x
&
e
u
-
Z
t
C
i
Q
j
L
S
8
g
T
m
)
s
1
n
9
\$
H
v
7
.
"
F
/
o
\194
E
U
v
R
p
Q
x
s
q
S
d
/
h
Z
c
K
-
O
g
E
m
3
T
N
b
A
.
$
n
L
e
I
0
F
t
,
i
"
o
y
Q
f
X
t
Z
k
"
i
.
p
$
s
Y
F
-
B
\194
S
8
,
7
C
/
A
W
x
h
c
T
U
n
P
_
Non-zero for 12.6% of words.
^ERA$ ____ (3.1) ^EDF$ ____ (3.0) ^consensus$ ____ (2.6) ^versus$ ____ (2.6) ^Jesus$ ____ (2.6) ^census$ ____ (2.6) ^ETA$ ____ (2.5) ^JERUS ALEM$____ (2.4) ^DETRO IT$____ (2.4) ^AIDS$ ____ (2.3) ^IRS$ ____ (2.3) ^DIEGO$ ____ (2.3) ^BERLI N$____ (2.2) ^PCs$ ____ (2.2) ^Petraeus$ ____ (2.1) ^NASDAQ $____ (2.1) ^EU$_ ___ (2.0) ^ECB$ ____ (2.0) ^Us$ ____(2.0) ^UBS$ ____ (2.0) ^AZUZ $____ (2.0) ^IRA$ ____ (1.9) ^EPA$ ____ (1.9) ^DENVER$_ ___ (1.9) ^SOURCE $____ (1.9) ^DAX $____(1.9) ^US$ ____(1.9) ^CEOs$ ____ (1.9) ^LAS$ ____ (1.9) ^DALLAS$ ____ (1.9) ^FDA$ ____ (1.9) ^stimulus$ ____ (1.9) ^plus$ ____ (1.9) ^Plus$ ____ (1.9) ^surplus$ ____ (1.9) ^US- led$____(1.9) ^UAE$ ____ (1.8) ^LLC$ ____ (1.8) ^PRNewswire-USN ewswire$____ (1.8) ^misuse $____ (1.8)