This page shows visualizations of some width-3 1-d convolutional filters from Google's lm_1b language model. Each column corresponds to one position in the filter, and shows the characters with the most positive weights. Use the checkbox in the bottom-right to also see the most negative weights (may be slow).
Below that are examples of words for which the filter emits the highest values. A filter's response is its maximum value over all substrings it sees in the word. So if a filter has high weights on 'c' in the first position, then 'a', then 't', it will assign equally high scores to 'cat', 'fatcat', 'concatenate', etc. The portion of the string in blue is the substring the filter is responding to.
'^' and '$' represent beginning and end of word markers, respectively. '_' is a padding character. Literal versions of those characters are escaped with a backslash.
Use the links at the top to see filters of other widths.
Check out my blog post here for a bit more context.
Show most negative weights
Filter 0 (bias = -0.62) #
/
k
O
v
L
t
G
q
<BOS>
b
o
B
Q
g
Y
w
K
i
\194
h
"
e
X
a
^
F
8
p
S
H
<EOS>
n
5
T
6
r
l
u
I
.
p
?
J
(
P
y
z
H
v
\$
B
F
b
h
K
!
\195
Q
0
u
l
s
o
M
%
"
;
t
k
S
x
/
i
:
9
A
G
&
R
t
b
I
x
\194
J
Q
f
/
G
T
9
$
Z
A
p
l
K
D
\195
,
3
S
8
O
v
C
w
B
P
g
-
y
E
Non-zero for 14.5% of words.
^Out $____ (1.8) ^Out side$____ (1.8) ^Lut her$____ (1.7) ^Lut on$____ (1.7) ^Ott awa$____ (1.7) ^about $____ (1.7) ^out $____ (1.7) ^without $____ (1.7) ^Sout h$____ (1.7) ^most $____ (1.6) ^lost $____ (1.6) ^almost $____ (1.6) ^cost $____ (1.6) ^cost s$____ (1.6) ^ht tp$____(1.6) ^Scott ish$____ (1.5) ^Scott $____ (1.5) ^bott om$____ (1.5) ^spott ed$____ (1.5) ^ut ility$____(1.4) ^ut ilities$____(1.4) ^ut terly$____(1.4) ^ut ter$____(1.4) ^L.$ ____ (1.3) ^N.F.L.$ ____ (1.3) ^Sut ton$____ (1.3) ^Sut herland$____ (1.3) ^L.A .$____ (1.3) ^st ill$____(1.3) ^st ate$____(1.3) ^st art$____(1.3) ^st atement$____(1.3) ^st udy$____(1.3) ^st arted$____(1.3) ^G.$ ____ (1.3) ^No.$ ____ (1.3) ^Co.$ ____ (1.3) ^Q.$ ____ (1.3) ^N.Y.$ ____ (1.3) ^D-N.Y.$ ____ (1.3) Filter 1 (bias = -0.39) #
f
u
K
D
w
T
W
d
N
v
<BOS>
l
x
j
Q
-
S
R
,
0
O
Y
y
g
F
h
e
J
X
H
B
1
3
C
5
z
2
\194
Z
t
U
f
u
F
Y
e
H
x
R
j
&
p
1
S
z
\$
T
c
Z
t
\195
.
A
r
)
N
a
n
i
w
/
g
L
\163
:
5
\194
o
;
E
9
m
x
g
8
t
B
y
W
p
Q
-
\194
j
N
i
7
P
6
r
"
T
q
l
X
O
v
D
4
d
5
s
b
G
3
M
a
f
2
U
Non-zero for 17.1% of words.
^Fairfax $____ (2.0) ^Halifax $____ (2.0) ^fix ed$____ (2.0) ^fix $____ (2.0) ^fix ture$____ (2.0) ^fix ing$____ (2.0) ^Vaux hall$____ (2.0) ^Bordeaux $____ (2.0) ^KAB UL$____ (1.9) ^UB S$____(1.8) ^Wax man$____ (1.7) ^Wu$ ____ (1.7) ^fun ds$____ (1.6) ^fun d$____ (1.6) ^fun ding$____ (1.6) ^fun $____ (1.6) ^UN $____(1.6) ^Nix on$____ (1.6) ^SUV $____ (1.6) ^NY$ ____ (1.5) ^Sub s$____ (1.5) ^fav orite$____ (1.5) ^fav or$____ (1.5) ^fav our$____ (1.5) ^fav ourite$____ (1.5) ^fiv e$____ (1.5) ^fiv e-year$____ (1.5) ^fiv e-day$____ (1.5) ^sexua l$____ (1.5) ^sexua lly$____ (1.5) ^homosexua lity$____ (1.5) ^sexua lity$____ (1.5) ^homosexua l$____ (1.5) ^Qua lity$____ (1.5) ^LSU$ ____ (1.5) ^swun g$____ (1.5) ^Lux embourg$____ (1.4) ^YOU$ ____ (1.4) ^ACORN $____ (1.4) ^19 $____(1.4) Filter 2 (bias = -0.45) #
<BOS>
k
e
u
X
U
l
B
L
h
j
v
D
C
O
R
5
i
-
b
6
T
d
Y
2
a
E
s
J
c
7
'
/
H
Q
V
.
m
N
A
Q
v
X
x
r
c
M
p
Y
s
H
t
:
i
Z
a
?
z
;
d
/
w
!
0
"
k
(
f
&
n
L
C
R
g
N
h
m
5
\195
B
Q
m
"
w
Y
f
X
-
\194
v
$
p
7
s
8
a
R
z
9
d
V
u
W
g
S
i
U
n
x
e
.
y
l
Non-zero for 13.5% of words.
^after$ ____ (1.7) ^her$ ____ (1.7) ^over$ ____ (1.7) ^other$ ____ (1.7) ^al-Q aida$____ (1.6) ^al-Q aeda$____ (1.6) ^Al-Q aeda$____ (1.6) ^Dr$ ____ (1.5) ^FOX$ ____ (1.4) ^Or$ ____ (1.4) ^Sadr$ ____ (1.3) ^al-Sadr$ ____ (1.3) ^JERUSALEM$ ____ (1.2) ^Jr$ ____ (1.2) ^Q$ ____(1.1) ^ALL$ ____ (1.1) ^them$ ____ (1.1) ^system$ ____ (1.1) ^problem$ ____ (1.1) ^seem$ ____ (1.1) ^Jerusalem$ ____ (1.1) ^stem$ ____ (1.1) ^System$ ____ (1.1) ^item$ ____ (1.1) ^NASDAQ $____ (1.1) ^GM$ ____ (1.0) ^MGM$ ____ (1.0) ^film$ ____ (1.0) ^calm$ ____ (1.0) ^Film$ ____ (1.0) ^Palm$ ____ (1.0) ^Malcolm$ ____ (1.0) ^palm$ ____ (1.0) ^helm$ ____ (1.0) ^NY$ ____ (1.0) ^AOL$ ____ (1.0) ^X$ ____(0.9) ^FOXN ews.com$____ (0.9) ^FOR$ ____ (0.9) ^M$ ____(0.9) Filter 3 (bias = -0.50) #
<BOS>
c
X
d
E
v
O
x
W
u
K
C
N
.
S
-
3
p
H
'
M
s
Q
n
J
k
4
g
2
y
5
z
Y
m
I
a
e
U
^
h
W
u
Q
R
X
j
"
D
!
H
x
n
:
h
p
C
K
J
w
U
a
r
b
-
\194
l
q
o
2
L
;
g
\163
T
?
M
6
)
V
i
-
x
D
k
d
f
J
S
u
B
1
h
Z
b
2
t
I
A
0
p
P
F
7
W
6
a
j
m
\195
,
3
K
G
'
9
y
R
r
L
c
Non-zero for 15.7% of words.
^Wu $____(1.8) ^WI TH$____(1.5) ^ex- wife$____ (1.5) ^Qu een$____(1.4) ^Qu ote$____(1.4) ^Qu eens$____(1.4) ^Qu inn$____(1.4) ^Wad e$____ (1.3) ^NEW$ ____ (1.3) ^32- year-old$____ (1.3) ^MOSCOW$ ____ (1.3) ^Nad al$____ (1.3) ^36- year-old$____ (1.3) ^NW$ ____ (1.3) ^22- year-old$____ (1.2) ^Sad dam$____ (1.2) ^Sad r$____ (1.2) ^al-Sad r$____ (1.2) ^Sad ly$____ (1.2) ^Had $____ (1.2) ^38- year-old$____ (1.2) ^Mad rid$____ (1.2) ^Mad off$____ (1.2) ^Mad onna$____ (1.2) ^Mad ison$____ (1.2) ^WH O$____(1.2) ^BMW$ ____ (1.2) ^six- month$____ (1.2) ^six- year$____ (1.2) ^six- party$____ (1.2) ^26- year-old$____ (1.2) ^28- year-old$____ (1.1) ^2- 1$____(1.1) ^2- 0$____(1.1) ^2- 2$____(1.1) ^2- 3$____(1.1) ^Wed nesday$____ (1.1) ^Spu rs$____ (1.1) ^35- year-old$____ (1.1) ^hip- hop$____ (1.1) Filter 4 (bias = -0.45) #
N
-
B
g
F
p
,
G
A
j
U
J
S
v
L
d
K
i
Q
m
f
0
a
P
/
z
W
V
8
b
t
c
"
w
s
\195
\194
T
y
l
Y
f
G
.
i
F
O
x
g
n
z
c
V
N
W
v
X
?
E
'
1
y
T
d
H
\$
A
-
2
u
4
r
3
e
J
q
I
M
&
m
W
c
H
z
Q
v
X
C
E
o
4
D
"
l
2
m
3
s
q
n
I
d
Y
f
7
x
N
.
$
p
6
'
e
-
0
g
t
Non-zero for 13.6% of words.
^BAGH DAD$____ (2.2) ^ANGE LES$____ (2.2) ^NEW $____ (2.1) ^NY$ ____ (2.1) ^BEIJING$ ____ (1.8) ^NYS E$____ (1.8) ^DENVE R$____ (1.7) ^Nie lsen$____ (1.7) ^FOX $____ (1.6) ^FOX News.com$____ (1.6) ^UAW $____ (1.6) ^BAE $____ (1.5) ^WASH INGTON$____ (1.5) ^Wi-Fi$ ____ (1.4) ^AG$ ____ (1.4) ^Fie ld$____ (1.4) ^Fie lds$____ (1.4) ^NO$ ____ (1.4) ^Bir mingham$____ (1.4) ^Bir d$____ (1.4) ^NW$ ____ (1.3) ^HBO$ ____ (1.3) ^TOKYO $____ (1.3) ^CHICAGO $____ (1.3) ^H1N1$ ____ (1.3) ^Bib le$____ (1.2) ^DNA$ ____ (1.2) ^LG$ ____ (1.2) ^BT$ ____ (1.2) ^NBA$ ____ (1.2) ^BA$ ____ (1.2) ^FBI$ ____ (1.1) ^RBI$ ____ (1.1) ^CBI$ ____ (1.1) ^UAE $____ (1.1) ^Fir st$____ (1.1) ^PRNewswire-Fir stCall$____ (1.1) ^Fir e$____ (1.1) ^Fir efighters$____ (1.1) ^Li$ ____ (1.1) Filter 5 (bias = -0.58) #
X
-
Z
f
Q
x
K
k
D
p
/
s
8
v
L
i
M
j
W
b
"
g
^
h
6
'
2
l
N
t
1
u
Y
r
G
n
7
d
3
m
R
X
u
v
r
6
s
e
U
0
-
5
'
B
?
p
&
q
k
W
(
T
Q
D
"
2
\195
x
C
K
!
l
o
4
Y
L
O
j
:
F
F
h
Q
y
P
q
j
n
X
u
E
w
J
o
7
a
L
i
D
x
S
v
6
k
5
-
G
.
8
g
K
p
9
t
V
H
Z
c
2
W
Non-zero for 11.6% of words.
^DUP $____ (1.9) ^MRS A$____ (1.9) ^AZUZ $____ (1.7) ^Que en$____ (1.7) ^Que ens$____ (1.7) ^Que stion$____ (1.7) ^Que ensland$____ (1.7) ^TARP $____ (1.6) ^UP I$____(1.6) ^YORK $____ (1.5) ^Due $____ (1.5) ^MRI $____ (1.5) ^Kre mlin$____ (1.5) ^Dre w$____ (1.5) ^Dre am$____ (1.5) ^Re uters$____(1.4) ^Re publican$____(1.4) ^Re publicans$____(1.4) ^Re search$____(1.4) ^Kuz netsova$____ (1.4) ^UE FA$____(1.4) ^Dr$ ____ (1.3) ^DVDs$ ____ (1.3) ^CDs$ ____ (1.3) ^R$ ____(1.3) ^US $____(1.3) ^US A$____(1.3) ^US C$____(1.3) ^US S$____(1.3) ^Dus tin$____ (1.3) ^UK $____(1.2) ^Mue ller$____ (1.2) ^6-7 $____ (1.2) ^Lul a$____ (1.2) ^HMRC $____ (1.2) ^Mul len$____ (1.2) ^Mul lah$____ (1.2) ^SEOUL $____ (1.1) ^TORO NTO$____ (1.1) ^FOR$ ____ (1.1) Filter 6 (bias = -0.57) #
R
m
9
y
Q
p
<BOS>
g
U
l
B
i
8
o
k
h
"
t
C
-
7
f
I
O
Z
e
\194
M
V
j
N
w
b
L
v
S
u
.
z
G
f
h
m
g
P
E
K
u
;
.
M
A
p
c
V
a
X
?
'
T
/
q
J
v
%
D
,
\163
i
d
n
(
l
t
6
0
Z
U
:
r
g
m
j
U
4
P
7
z
F
f
S
p
E
o
5
T
h
K
8
B
e
t
Q
u
V
k
3
y
2
i
6
a
X
l
Z
,
9
\195
0
v
Non-zero for 19.8% of words.
^Rig hts$____ (1.8) ^Rig ht$____ (1.8) ^TARP$ ____ (1.6) ^YORK$ ____ (1.5) ^Rih anna$____ (1.4) ^1964 $____ (1.4) ^Uig hurs$____ (1.3) ^Uig hur$____ (1.3) ^1967 $____ (1.3) ^Big $____ (1.3) ^Reg ional$____ (1.2) ^Reg iment$____ (1.2) ^Reg ion$____ (1.2) ^Reg ardless$____ (1.2) ^Rog er$____ (1.2) ^Rog ers$____ (1.2) ^Afg hanistan$____ (1.2) ^Afg han$____ (1.2) ^Afg hans$____ (1.2) ^safe ty$____ (1.2) ^safe $____ (1.2) ^Safe ty$____ (1.2) ^safe ly$____ (1.2) ^safe r$____ (1.2) ^RBS $____ (1.2) ^1954 $____ (1.2) ^upg rade$____ (1.2) ^upg raded$____ (1.2) ^upg rades$____ (1.2) ^fe w$____(1.2) ^fe deral$____(1.2) ^fe ll$____(1.2) ^fe el$____(1.2) ^fe et$____(1.2) ^1965 $____ (1.2) ^consume rs$____ (1.2) ^consume r$____ (1.2) ^docume nts$____ (1.2) ^argume nt$____ (1.2) ^IMF $____ (1.2) Filter 7 (bias = -0.60) #
B
d
w
p
W
-
K
g
<BOS>
l
M
P
N
j
X
C
Z
D
q
r
E
s
b
h
v
G
V
R
2
u
Q
y
"
'
9
1
m
7
<EOS>
c
?
p
.
P
W
F
(
f
q
D
H
j
!
d
:
G
w
c
Y
%
u
S
Q
s
-
C
N
0
"
l
&
K
Z
z
X
L
\194
)
5
g
f
V
y
X
U
0
F
G
r
7
N
Y
u
1
s
Z
o
6
B
5
t
j
a
2
,
i
x
4
K
J
.
I
L
\194
m
C
O
w
S
Non-zero for 9.9% of words.
^Bag hdad$____ (2.0) ^wag es$____ (1.9) ^wag e$____ (1.9) ^Volkswag en$____ (1.9) ^sewag e$____ (1.9) ^B.$ ____ (1.9) ^downg raded$____ (1.9) ^downg rade$____ (1.9) ^Wag ner$____ (1.8) ^Bui lding$____ (1.7) ^Mug abe$____ (1.7) ^W.$ ____ (1.7) ^low-i ncome$____ (1.7) ^Nug gets$____ (1.6) ^N.Y .$____ (1.5) ^D-N.Y .$____ (1.5) ^BAG HDAD$____ (1.5) ^Buz z$____ (1.4) ^swun g$____ (1.4) ^Bul ls$____ (1.4) ^Bul l$____ (1.4) ^Bul garia$____ (1.4) ^Bul ldogs$____ (1.4) ^Bul lock$____ (1.4) ^N.J .$____ (1.4) ^Bai ley$____ (1.4) ^Big $____ (1.4) ^U.K.$ ____ (1.4) ^K.$ ____ (1.4) ^whi ch$____ (1.3) ^whi le$____ (1.3) ^whi te$____ (1.3) ^Meanwhi le$____ (1.3) ^meanwhi le$____ (1.3) ^Wu$ ____ (1.3) ^wai ting$____ (1.3) ^wai t$____ (1.3) ^Hawai i$____ (1.3) ^wai ted$____ (1.3) ^Mag ic$____ (1.3) Filter 8 (bias = -0.66) #
<BOS>
T
Q
D
s
y
-
p
w
h
W
P
"
d
S
H
'
c
^
B
<EOS>
L
\194
F
V
m
E
t
q
r
l
f
k
1
U
p
u
g
Q
j
Z
-
B
i
L
S
&
w
D
x
"
o
T
e
R
f
/
O
)
5
\194
l
Y
E
H
h
X
n
8
t
!
s
?
G
-
A
'
h
V
t
f
T
P
H
b
y
R
D
J
a
x
L
9
1
s
q
v
W
Q
N
j
O
k
E
\194
c
Z
o
"
g
p
B
7
i
Non-zero for 11.0% of words.
^SUV $____ (2.4) ^UP I$____(2.1) ^suf fered$____ (2.1) ^suf fering$____ (2.1) ^suf fer$____ (2.1) ^suf ficient$____ (2.1) ^sub ject$____ (2.0) ^sub stantial$____ (2.0) ^sub mitted$____ (2.0) ^sub sidiary$____ (2.0) ^sub prime$____ (2.0) ^sub sequent$____ (2.0) ^sub sidies$____ (2.0) ^sub stance$____ (2.0) ^D- Calif$____(2.0) ^D- N.Y.$____(2.0) ^T- shirts$____(1.9) ^T- shirt$____(1.9) ^T- Mobile$____(1.9) ^Us ing$____(1.8) ^Us $____(1.8) ^Us e$____(1.8) ^Us ers$____(1.8) ^Us ually$____(1.8) ^sus pected$____ (1.7) ^sus pect$____ (1.7) ^sus pended$____ (1.7) ^sus pects$____ (1.7) ^Suf folk$____ (1.7) ^U.S.- led$____ (1.6) ^U.S.- backed$____ (1.6) ^9- year-old$____(1.6) ^Uk raine$____(1.6) ^Uk rainian$____(1.6) ^SOUR CE$____ (1.6) ^Sub s$____ (1.5) ^ub iquitous$____(1.5) ^LSU$ ____ (1.4) ^Up $____(1.4) ^Up per$____(1.4) Filter 9 (bias = -0.56) #
y
-
C
w
F
I
c
E
P
<BOS>
Y
J
h
j
"
i
k
t
8
2
K
.
m
e
p
u
Q
n
G
s
V
g
L
o
B
z
T
v
b
a
-
F
J
S
X
t
\195
c
Y
y
9
s
6
f
1
k
o
A
w
C
Z
h
0
\$
:
U
3
r
2
x
7
\163
n
,
u
e
i
.
l
E
Y
-
h
w
Q
z
S
d
"
f
V
P
\194
m
W
J
4
e
7
D
H
E
8
U
$
\195
x
p
C
u
k
o
X
I
9
l
K
v
Non-zero for 9.5% of words.
^hip-h op$____ (3.6) ^career-h igh$____ (3.4) ^half-h our$____ (3.3) ^24-h our$____ (2.9) ^anywh ere$____ (2.9) ^everywh ere$____ (2.9) ^second-h alf$____ (2.8) ^two-h our$____ (2.6) ^Coh en$____ (2.5) ^alcoh ol$____ (2.5) ^alcoh olic$____ (2.5) ^Alcoh ol$____ (2.5) ^season-h igh$____ (2.5) ^in-h ouse$____ (2.5) ^first-h alf$____ (2.4) ^right-h ander$____ (2.4) ^left-h ander$____ (2.4) ^worthwh ile$____ (2.3) ^D-C alif$____ (2.2) ^Loh an$____ (2.1) ^6-4 $____ (2.1) ^Doh a$____ (2.1) ^6-7 $____ (2.1) ^Secretary-G eneral$____ (2.1) ^Moh ammed$____ (2.0) ^Moh amed$____ (2.0) ^Moh ammad$____ (2.0) ^al-Q aida$____ (2.0) ^al-Q aeda$____ (2.0) ^Al-Q aeda$____ (2.0) ^DJ$ ____ (2.0) ^proh ibited$____ (2.0) ^proh ibit$____ (2.0) ^proh ibits$____ (2.0) ^fourth-q uarter$____ (2.0) ^high-q uality$____ (2.0) ^overwh elming$____ (2.0) ^overwh elmingly$____ (2.0) ^overwh elmed$____ (2.0) ^All-S tar$____ (2.0) Filter 10 (bias = -0.44) #
H
x
U
v
L
f
Z
c
P
b
1
o
D
w
/
S
y
g
X
.
3
-
6
k
Q
t
7
'
2
e
I
s
C
p
8
<BOS>
,
j
^
h
b
t
Y
d
V
I
B
n
G
y
J
-
X
D
x
p
9
C
K
,
Z
\$
"
c
)
.
8
f
L
u
M
s
6
e
%
r
m
i
0
H
'
y
\194
p
t
h
j
a
-
K
.
P
C
x
$
L
s
1
Q
3
I
H
V
f
R
B
S
8
z
r
/
q
v
i
b
e
2
Non-zero for 15.3% of words.
^SUV$ ____ (2.2) ^doubt $____ (2.0) ^doubt s$____ (2.0) ^subt le$____ (2.0) ^undoubt edly$____ (2.0) ^subj ect$____ (1.9) ^subj ects$____ (1.9) ^subj ected$____ (1.9) ^sub- prime$____ (1.9) ^1bn $____ (1.9) ^b$ ____(1.7) ^club$ ____ (1.7) ^Club$ ____ (1.7) ^pub$ ____ (1.7) ^hub$ ____ (1.7) ^Y$ ____(1.7) ^clubs $____ (1.7) ^subs tantial$____ (1.7) ^subs idiary$____ (1.7) ^subs equent$____ (1.7) ^subs idies$____ (1.7) ^subs tance$____ (1.7) ^subs equently$____ (1.7) ^subs titute$____ (1.7) ^NHL$ ____ (1.7) ^19t h$____ (1.7) ^19t h-century$____ (1.7) ^LG$ ____ (1.7) ^UBS $____ (1.6) ^19- year-old$____ (1.6) ^UK$ ____ (1.6) ^XVI $____ (1.6) ^PBS $____ (1.5) ^AZUZ$ ____ (1.5) ^V. $____(1.5) ^HIV$ ____ (1.5) ^IV$ ____ (1.5) ^DJ$ ____ (1.5) ^HMR C$____ (1.5) ^18t h$____ (1.4) Filter 11 (bias = -0.48) #
A
-
/
v
,
d
B
g
I
u
C
p
Q
b
N
x
H
j
W
r
5
G
X
J
t
e
K
f
n
m
Z
E
^
o
4
P
2
\163
'
X
s
Q
o
H
m
8
z
7
f
2
l
6
w
1
'
"
g
q
-
D
x
9
t
3
S
4
n
W
%
P
k
e
i
?
c
Z
.
E
A
G
F
z
f
w
h
g
u
Z
t
m
x
K
k
L
d
O
q
X
,
A
v
c
R
Y
H
V
B
.
N
U
j
E
\194
M
I
\195
9
/
r
Non-zero for 10.4% of words.
^ANG ELES$____ (1.9) ^AIG $____ (1.8) ^DIEG O$____ (1.3) ^WASHING TON$____ (1.3) ^BEIJING $____ (1.3) ^Arg entina$____ (1.3) ^Arg entine$____ (1.3) ^Adm inistration$____ (1.2) ^Adm iral$____ (1.2) ^Adm .$____ (1.2) ^3G $____(1.2) ^Aug ust$____ (1.2) ^Aug $____ (1.2) ^Aug usta$____ (1.2) ^PG A$____(1.2) ^LPG A$____ (1.1) ^Hez bollah$____ (1.0) ^DAX$ ____ (1.0) ^Arm y$____ (1.0) ^Arm strong$____ (1.0) ^Arm ed$____ (1.0) ^Arm enia$____ (1.0) ^Arm s$____ (1.0) ^New $____ (1.0) ^New s$____ (1.0) ^PRNew swire$____ (1.0) ^PRNew swire-FirstCall$____ (1.0) ^Hew itt$____ (1.0) ^ATL ANTA$____ (1.0) ^VEG AS$____ (1.0) ^Benitez $____ (0.9) ^Ben\195\173tez $____ (0.9) ^Dw ight$____(0.9) ^NATO $____ (0.9) ^Venez uela$____ (0.9) ^Venez uelan$____ (0.9) ^Martinez $____ (0.9) ^PHILADE LPHIA$____ (0.9) ^NASDAQ$ ____ (0.9) ^MG M$____(0.9) Filter 12 (bias = -0.51) #
j
a
V
y
<BOS>
o
4
u
5
.
7
U
S
T
Z
r
6
h
M
m
F
q
3
b
2
d
X
L
C
A
I
x
J
v
9
z
Q
l
^
\195
Q
g
X
d
N
p
Z
h
/
i
B
s
!
G
?
E
(
j
.
S
&
x
K
y
:
-
\194
u
q
%
M
1
n
e
W
0
w
c
V
o
d
w
D
B
1
b
7
f
C
k
P
K
H
m
6
o
8
N
I
M
0
x
2
.
4
E
L
v
p
W
5
r
\194
S
l
z
j
t
X
s
Non-zero for 9.9% of words.
^Ahmadinejad $____ (2.0) ^2nd $____ (1.8) ^22nd $____ (1.8) ^4.1 $____ (1.7) ^5.1 $____ (1.7) ^Ind ia$____ (1.6) ^Ind ian$____ (1.6) ^Ind iana$____ (1.6) ^Ind onesia$____ (1.6) ^4.7 $____ (1.5) ^5.7 $____ (1.5) ^Md $____(1.5) ^jud ge$____ (1.4) ^jud ges$____ (1.4) ^jud gment$____ (1.4) ^jud icial$____ (1.4) ^6.1 $____ (1.4) ^find $____ (1.4) ^ind ustry$____ (1.4) ^behind $____ (1.4) ^kind $____ (1.4) ^ind ependent$____ (1.4) ^ind ex$____ (1.4) ^mind $____ (1.4) ^3rd $____ (1.4) ^23rd $____ (1.4) ^3.1 $____ (1.4) ^2.1 $____ (1.4) ^Sad dam$____ (1.4) ^Sad r$____ (1.4) ^al-Sad r$____ (1.4) ^Sad ly$____ (1.4) ^end $____ (1.4) ^spend ing$____ (1.4) ^end ed$____ (1.4) ^weekend $____ (1.4) ^HSBC $____ (1.4) ^SNP $____ (1.3) ^QC $____(1.3) ^'d $____(1.3) Filter 13 (bias = -0.48) #
<BOS>
y
J
p
j
c
X
f
l
x
-
h
\194
a
I
o
V
s
Q
,
Y
k
7
C
6
r
0
F
^
n
Z
U
R
i
\195
O
9
A
M
K
R
p
Z
x
U
f
u
v
Q
l
?
o
(
i
&
d
H
t
r
m
Y
e
)
,
M
y
"
a
V
q
A
W
N
h
!
c
C
K
E
0
k
d
w
-
B
D
K
u
z
h
V
l
W
j
A
L
U
y
b
.
G
1
Z
x
Y
F
E
7
S
H
Q
f
T
e
O
n
X
6
c
P
Non-zero for 13.5% of words.
^Demjanjuk $____ (2.9) ^Rw anda$____(2.4) ^RB S$____(2.4) ^RB I$____(2.4) ^RB Is$____(2.4) ^Uk raine$____(2.2) ^Uk rainian$____(2.2) ^UB S$____(2.1) ^Muk asey$____ (2.1) ^Suzuk i$____ (2.1) ^Duk e$____ (2.1) ^Luk e$____ (2.0) ^HB OS$____(1.9) ^HB O$____(1.9) ^Jak e$____ (1.8) ^Jak arta$____ (1.8) ^IRA $____ (1.8) ^UK $____(1.8) ^well-k nown$____ (1.7) ^Jew ish$____ (1.7) ^Jew s$____ (1.7) ^club $____ (1.7) ^Club $____ (1.7) ^club s$____ (1.7) ^nightclub $____ (1.7) ^Nk unda$____(1.7) ^guardian.co.uk $____ (1.7) ^Aw ards$____(1.6) ^Aw ard$____(1.6) ^Aw akening$____(1.6) ^Merk el$____ (1.6) ^Berk shire$____ (1.6) ^Berk eley$____ (1.6) ^clerk $____ (1.6) ^jaw $____ (1.6) ^RA F$____(1.6) ^AB C$____(1.6) ^NB C$____(1.6) ^NB A$____(1.6) ^Ik e$____(1.5) Filter 14 (bias = -0.51) #
B
-
K
g
C
r
Z
d
c
u
V
h
k
j
U
E
9
H
v
O
F
l
n
e
z
i
f
o
5
.
8
p
M
y
,
\163
6
I
x
Y
M
x
Z
p
m
a
-
h
?
d
/
v
u
S
K
A
R
t
J
q
'
c
Y
E
&
\163
!
0
X
F
N
s
(
i
H
k
r
g
Q
l
Q
j
W
o
X
f
"
J
q
l
a
-
Z
t
.
D
b
i
$
s
\194
S
V
e
8
p
Y
O
P
T
E
M
g
F
Non-zero for 8.4% of words.
^BMW $____ (2.5) ^IBM$ ____ (2.1) ^Bra zil$____ (1.3) ^Bra zilian$____ (1.3) ^Bra d$____ (1.3) ^Bra dy$____ (1.3) ^Boa rd$____ (1.3) ^Bob $____ (1.2) ^Bob by$____ (1.2) ^By$ ____ (1.1) ^B.$ ____ (1.1) ^HBO$ ____ (1.1) ^Kra ft$____ (1.0) ^BP$ ____ (1.0) ^PKK$ ____ (1.0) ^Uma r$____ (0.9) ^Buy $____ (0.9) ^'Ma lley$____ (0.9) ^AZUZ$ ____ (0.9) ^Bea ch$____ (0.9) ^Bea r$____ (0.9) ^Bea tles$____ (0.9) ^Bea rs$____ (0.9) ^km$ ____ (0.8) ^York-b ased$____ (0.8) ^Ma rch$____(0.8) ^Ma y$____(0.8) ^Ma ny$____(0.8) ^Ma rk$____(0.8) ^Myanma r$____ (0.8) ^inma tes$____ (0.8) ^Denma rk$____ (0.8) ^gunma n$____ (0.8) ^inma te$____ (0.8) ^WAM$ ____ (0.8) ^AM$ ____ (0.8) ^Box $____ (0.8) ^Box ing$____ (0.8) ^BBC$ ____ (0.8) ^ABC$ ____ (0.8) Filter 15 (bias = -0.53) #
.
J
'
B
s
K
Q
e
C
E
g
P
\194
f
/
T
0
N
9
o
3
\195
M
D
2
v
j
F
G
n
Y
f
X
.
O
a
M
x
K
t
E
q
"
-
T
d
3
F
Z
v
0
A
J
u
V
l
Q
k
8
'
W
s
)
h
6
\$
4
p
x
Z
W
M
h
F
v
P
a
J
\194
j
p
D
o
L
Y
r
S
K
i
n
"
U
t
e
q
N
'
\195
d
R
O
H
l
C
s
f
$
3
Non-zero for 11.5% of words.
^Gh ana$____(2.4) ^Ga za$____(2.2) ^Ga mes$____(2.2) ^Ga tes$____(2.2) ^Ga ry$____(2.2) ^Go vernment$____(2.1) ^Go ogle$____(2.1) ^Go rdon$____(2.1) ^Go v.$____(2.1) ^Gi ants$____(2.0) ^Gi ven$____(2.0) ^Gi uliani$____(2.0) ^Gi bbs$____(2.0) ^Gi bson$____(2.0) ^GO P$____(1.8) ^Gl obal$____(1.8) ^Gl asgow$____(1.8) ^Gl obe$____(1.8) ^Gl enn$____(1.8) ^CHICAGO $____ (1.8) ^Ya hoo$____(1.8) ^Ya nkees$____(1.8) ^Ya ng$____(1.8) ^Ya le$____(1.8) ^Ox ford$____(1.8) ^Ox fordshire$____(1.8) ^G$ ____(1.8) ^G. $____(1.8) ^G. M.$____(1.8) ^AG$ ____ (1.7) ^N.Y. $____ (1.7) ^D-N.Y. $____ (1.7) ^Yo rk$____(1.7) ^Yo u$____(1.7) ^Yo ung$____(1.7) ^Yo ur$____(1.7) ^MOSCOW $____ (1.7) ^AIG$ ____ (1.5) ^Ex change$____(1.5) ^Ex ecutive$____(1.5) Filter 16 (bias = -0.58) #
w
y
z
.
I
h
i
r
J
F
2
m
0
u
5
L
E
f
B
'
<BOS>
M
W
d
v
"
K
H
3
c
6
D
9
Q
4
b
V
N
1
x
Q
f
"
%
W
L
\194
l
Y
m
(
J
T
x
X
-
&
P
C
n
b
a
s
\195
p
d
e
F
U
o
Z
h
G
x
X
u
w
H
z
f
M
a
K
A
2
k
Q
r
V
t
E
y
0
l
J
i
c
,
5
o
$
F
6
q
-
B
g
S
D
R
Non-zero for 10.1% of words.
^awkw ard$____ (2.0) ^ITV $____ (1.6) ^Switz erland$____ (1.5) ^Spitz er$____ (1.5) ^Fritz l$____ (1.5) ^Fitz gerald$____ (1.5) ^ICE $____ (1.5) ^IT$ ____ (1.4) ^DETROIT$ ____ (1.4) ^CIT$ ____ (1.4) ^EBITD A$____ (1.4) ^WASHING TON$____ (1.3) ^BEIJING $____ (1.3) ^FDIC$ ____ (1.3) ^NEW$ ____ (1.2) ^DIEG O$____ (1.2) ^HIV$ ____ (1.2) ^IV$ ____ (1.2) ^PARIS$ ____ (1.2) ^IS$ ____ (1.2) ^Malik$ ____ (1.1) ^Erik$ ____ (1.1) ^Henrik$ ____ (1.1) ^Ike $____ (1.1) ^It$ ____ (1.1) ^pitc h$____ (1.1) ^kitc hen$____ (1.1) ^Mitc hell$____ (1.1) ^switc h$____ (1.1) ^like $____ (1.0) ^like ly$____ (1.0) ^strike $____ (1.0) ^Mike $____ (1.0) ^unlike ly$____ (1.0) ^it$ ____ (1.0) ^hit$ ____ (1.0) ^credit$ ____ (1.0) ^visit$ ____ (1.0) ^public$ ____ (1.0) ^economic$ ____ (1.0) Filter 17 (bias = -0.48) #
R
.
C
g
P
h
Q
m
9
l
8
w
U
e
"
A
7
b
3
t
,
x
I
q
1
L
6
-
\194
v
^
E
<BOS>
j
Z
a
'
o
2
T
-
C
E
A
v
c
X
n
u
p
"
f
W
y
e
,
:
o
J
K
6
k
Q
r
\194
t
2
F
?
B
d
S
)
x
j
h
!
l
M
L
b
O
V
t
Y
f
.
P
Z
p
h
y
x
d
q
i
g
E
Q
U
X
,
9
D
7
s
\194
T
n
e
R
I
A
K
B
o
8
S
'
2
Non-zero for 12.4% of words.
^Rub in$____ (2.1) ^Reb ecca$____ (2.0) ^Cub a$____ (2.0) ^Cub an$____ (2.0) ^Cub s$____ (2.0) ^Cub ans$____ (2.0) ^Pub lic$____ (1.9) ^York-b ased$____ (1.5) ^cross-b order$____ (1.4) ^Rug by$____ (1.4) ^CCTV $____ (1.4) ^Reg ional$____ (1.3) ^Reg iment$____ (1.3) ^Reg ion$____ (1.3) ^Reg ardless$____ (1.3) ^London-b ased$____ (1.3) ^Washington-b ased$____ (1.3) ^would-b e$____ (1.3) ^record-b reaking$____ (1.3) ^Xb ox$____(1.3) ^Run $____ (1.3) ^Run ning$____ (1.3) ^ub iquitous$____(1.2) ^ICE$ ____ (1.2) ^SOURCE$ ____ (1.2) ^Ren ault$____ (1.2) ^Ren aissance$____ (1.2) ^Ren dell$____ (1.2) ^E. $____(1.2) ^Cun ningham$____ (1.2) ^Cab inet$____ (1.2) ^Cab le$____ (1.2) ^Cab rera$____ (1.2) ^Cen ter$____ (1.1) ^Cen tral$____ (1.1) ^Cen tre$____ (1.1) ^Cen tury$____ (1.1) ^Pun jab$____ (1.1) ^Pab lo$____ (1.1) ^24-h our$____ (1.0) Filter 18 (bias = -0.58) #
i
.
4
c
3
z
j
d
H
D
J
U
5
'
F
r
6
C
S
G
V
v
f
y
B
a
h
T
e
u
2
Q
M
m
7
\163
W
-
9
g
M
p
v
I
)
P
"
l
b
i
Z
,
Y
d
V
t
9
n
\194
a
?
A
X
O
B
r
8
C
W
%
E
f
u
y
4
1
6
s
.
o
Q
f
Y
v
X
B
$
p
Z
P
"
x
g
k
V
a
G
t
7
F
\194
U
4
m
d
u
e
b
T
o
K
y
Non-zero for 13.3% of words.
^Aviv$ ____ (2.4) ^Lib$ ____ (2.0) ^IBM$ ____ (2.0) ^Medvedev$ ____ (1.8) ^Kiev$ ____ (1.8) ^JERUSALEM$ ____ (1.7) ^49$ ____ (1.7) ^1949$ ____ (1.7) ^BMW $____ (1.6) ^48$ ____ (1.6) ^1948$ ____ (1.6) ^Liu$ ____ (1.6) ^39$ ____ (1.5) ^1939$ ____ (1.5) ^Web$ ____ (1.5) ^Feb$ ____ (1.5) ^web$ ____ (1.5) ^HMR C$____ (1.4) ^38$ ____ (1.4) ^44$ ____ (1.4) ^1944$ ____ (1.4) ^46$ ____ (1.4) ^1946$ ____ (1.4) ^59$ ____ (1.4) ^1959$ ____ (1.4) ^PM$ ____ (1.4) ^six$ ____ (1.3) ^mix$ ____ (1.3) ^Six$ ____ (1.3) ^Phoenix$ ____ (1.3) ^fix$ ____ (1.3) ^Prix$ ____ (1.3) ^Felix$ ____ (1.3) ^Netflix$ ____ (1.3) ^public$ ____ (1.3) ^economic$ ____ (1.3) ^Democratic$ ____ (1.3) ^music$ ____ (1.3) ^40$ ____ (1.3) ^140$ ____ (1.3) Filter 19 (bias = -0.51) #
l
v
Y
u
O
F
o
k
S
e
L
f
A
E
G
q
/
d
z
\163
<BOS>
U
K
x
5
y
X
c
,
b
\194
a
<EOS>
w
j
9
J
8
^
"
N
d
Q
m
r
i
?
s
R
y
X
f
9
l
w
u
q
%
(
U
"
p
B
L
E
P
K
h
W
x
Z
F
b
,
&
D
Y
H
O
-
\194
-
Q
m
x
p
C
y
9
i
F
r
8
g
B
T
5
P
7
O
S
H
V
u
6
w
N
E
4
\195
"
e
v
o
$
G
'
d
c
J
Non-zero for 22.0% of words.
^FRANC ISCO$____ (1.9) ^ANC $____ (1.9) ^MSNB C$____ (1.8) ^complex $____ (1.7) ^Alex $____ (1.7) ^Alex ander$____ (1.7) ^flex ible$____ (1.7) ^NC AA$____(1.5) ^WASHINGTON$ ____ (1.5) ^LONDON$ ____ (1.5) ^BOSTON$ ____ (1.5) ^ON$ ____ (1.5) ^HOUSTON$ ____ (1.5) ^NF L$____(1.4) ^NF C$____(1.4) ^FARC $____ (1.4) ^bulbs $____ (1.4) ^SAN$ ____ (1.4) ^TEHRAN$ ____ (1.4) ^NB C$____(1.3) ^NB A$____(1.3) ^lov e$____ (1.3) ^lov ed$____ (1.3) ^lov ely$____ (1.3) ^lov es$____ (1.3) ^QC $____(1.3) ^solo$ ____ (1.3) ^Capello$ ____ (1.3) ^Buffalo$ ____ (1.3) ^Carlo$ ____ (1.3) ^Apollo$ ____ (1.3) ^Colo$ ____ (1.3) ^Paulo$ ____ (1.3) ^Pablo$ ____ (1.3) ^OEC D$____ (1.3) ^relax ed$____ (1.3) ^Galax y$____ (1.3) ^relax $____ (1.3) ^HSBC $____ (1.3) ^Col.$ ____ (1.3) Filter 20 (bias = -0.46) #
U
-
B
.
k
j
z
d
R
h
K
l
V
y
9
e
Q
g
Y
n
W
D
b
f
<BOS>
t
E
o
a
m
"
p
<EOS>
F
G
M
Z
L
2
c
W
j
K
F
O
u
X
h
y
s
p
g
;
v
Q
R
!
)
2
l
w
k
a
-
,
x
"
S
P
.
/
b
1
C
I
J
?
V
:
'
A
-
z
f
c
j
Q
u
W
J
.
P
a
i
w
h
X
r
K
x
t
p
/
R
B
m
T
F
N
M
I
'
U
d
Z
k
\194
s
2
e
Non-zero for 13.8% of words.
^UK$ ____ (1.6) ^Upt on$____ (1.6) ^WA SHINGTON$____(1.6) ^WA M$____(1.6) ^Bac k$____ (1.5) ^backya rd$____ (1.4) ^KA BUL$____(1.4) ^Up$ ____ (1.3) ^HBO$ ____ (1.3) ^BAA $____ (1.2) ^Rya n$____ (1.2) ^Rya nair$____ (1.2) ^By$ ____ (1.2) ^Blackwa ter$____ (1.2) ^awkwa rd$____ (1.2) ^backwa rd$____ (1.2) ^backwa rds$____ (1.2) ^Kaz akhstan$____ (1.2) ^Bat h$____ (1.2) ^Bat talion$____ (1.2) ^Bat tle$____ (1.2) ^Bat man$____ (1.2) ^EPA $____ (1.1) ^breakaw ay$____ (1.1) ^VW$ ____ (1.1) ^Rac hel$____ (1.1) ^Rac e$____ (1.1) ^Rac ing$____ (1.1) ^UPI $____ (1.1) ^TORON TO$____ (1.1) ^YORK$ ____ (1.1) ^DETROI T$____ (1.1) ^Sarkoz y$____ (1.0) ^W. $____(1.0) ^Kentucky$ ____ (1.0) ^lucky$ ____ (1.0) ^sky$ ____ (1.0) ^risky$ ____ (1.0) ^PKK$ ____ (1.0) ^USA $____ (1.0) Filter 21 (bias = -0.22) #
<BOS>
y
Y
f
J
p
b
n
X
c
B
F
E
d
z
C
Q
t
R
,
V
s
9
'
\195
h
W
x
\194
m
"
i
<EOS>
D
T
P
0
g
G
e
G
f
Y
d
g
x
V
F
Z
v
R
t
?
a
X
e
)
q
&
B
M
,
Q
\$
!
D
C
N
J
.
3
l
p
y
u
n
P
.
U
c
V
h
I
o
i
x
C
N
6
g
X
q
Q
r
z
w
/
y
$
e
7
v
l
-
\194
f
,
n
s
\163
Y
b
J
E
R
t
Non-zero for 11.3% of words.
^GP $____(2.3) ^GP S$____(2.3) ^GP s$____(2.3) ^XVI $____ (2.1) ^YOU $____ (1.8) ^Gi ants$____(1.8) ^Gi ven$____(1.8) ^Gi uliani$____(1.8) ^Gi bbs$____(1.8) ^CITY$ ____ (1.6) ^gi ve$____(1.6) ^gi ven$____(1.6) ^gi ving$____(1.6) ^gi rl$____(1.6) ^Belgi um$____ (1.6) ^Belgi an$____ (1.6) ^G$ ____(1.6) ^Gl obal$____(1.6) ^Gl asgow$____(1.6) ^Gl obe$____(1.6) ^Gl enn$____(1.6) ^Gl en$____(1.6) ^Gl oucester$____(1.6) ^Gl oucestershire$____(1.6) ^JERU SALEM$____ (1.6) ^MVP $____ (1.5) ^Y$ ____(1.5) ^NYC $____ (1.5) ^DIEGO $____ (1.5) ^LG$ ____ (1.5) ^gl obal$____(1.4) ^gl ass$____(1.4) ^gl obe$____(1.4) ^gl ad$____(1.4) ^BEIJING$ ____ (1.4) ^VEGA S$____ (1.4) ^3G$ ____ (1.4) ^G2 0$____(1.4) ^NY$ ____ (1.4) ^BHP $____ (1.4) Filter 22 (bias = -0.68) #
K
u
5
-
,
g
6
r
D
k
2
b
0
R
c
H
8
h
X
Y
/
m
B
j
W
'
L
\195
N
V
O
.
p
s
\194
i
f
E
9
U
-
A
j
B
:
c
'
a
O
h
;
k
s
T
"
q
J
C
M
t
!
v
%
F
Q
D
n
U
N
0
L
x
b
h
-
F
v
A
w
H
z
S
J
C
T
L
o
7
E
y
M
4
\195
,
u
8
m
x
K
5
b
Q
e
1
O
Y
B
6
t
3
k
l
W
Non-zero for 7.6% of words.
^hip-h op$____ (2.8) ^half-h our$____ (2.7) ^two-h our$____ (2.7) ^15-y ear-old$____ (2.7) ^25-y ear-old$____ (2.7) ^35-y ear-old$____ (2.7) ^5-y ear-old$____ (2.7) ^5-4 $____ (2.6) ^season-h igh$____ (2.6) ^in-h ouse$____ (2.6) ^24-h our$____ (2.6) ^second-h alf$____ (2.5) ^first-h alf$____ (2.5) ^right-h ander$____ (2.5) ^left-h ander$____ (2.5) ^6-7 $____ (2.5) ^D-C alif$____ (2.4) ^16-y ear-old$____ (2.4) ^26-y ear-old$____ (2.4) ^36-y ear-old$____ (2.4) ^5-1 $____ (2.4) ^6-4 $____ (2.4) ^5-3 $____ (2.3) ^22-y ear-old$____ (2.3) ^12-y ear-old$____ (2.3) ^32-y ear-old$____ (2.3) ^2-y ear-old$____ (2.3) ^10-y ear$____ (2.3) ^20-y ear-old$____ (2.3) ^30-y ear-old$____ (2.3) ^30-y ear$____ (2.3) ^10-y ear-old$____ (2.3) ^40-y ear-old$____ (2.3) ^50-y ear-old$____ (2.3) ^18-y ear-old$____ (2.3) ^28-y ear-old$____ (2.3) ^38-y ear-old$____ (2.3) ^6-1 $____ (2.2) ^African-A merican$____ (2.2) ^African-A mericans$____ (2.2) Filter 23 (bias = -0.43) #
U
o
<BOS>
h
z
g
m
N
Q
r
s
j
'
e
P
0
\194
c
V
y
k
1
b
O
"
3
C
q
/
w
^
E
v
n
d
J
a
5
X
H
Y
f
0
.
T
y
1
m
V
F
)
e
G
\$
C
x
i
!
7
?
g
-
J
s
R
N
k
r
4
L
9
'
H
a
\194
%
z
:
6
w
x
u
b
A
V
t
X
r
W
D
v
H
K
g
B
I
m
d
"
-
\194
R
6
j
Q
n
f
s
p
U
Y
E
9
o
8
1
'
h
k
.
Non-zero for 16.6% of words.
^mix ed$____ (2.1) ^mix $____ (2.1) ^mix ture$____ (2.1) ^mix ing$____ (2.1) ^six $____ (2.1) ^six th$____ (2.1) ^six -month$____ (2.1) ^six -year$____ (2.1) ^possib le$____ (2.1) ^responsib le$____ (2.1) ^responsib ility$____ (2.1) ^possib ility$____ (2.1) ^possib ly$____ (2.1) ^impossib le$____ (2.1) ^CCTV $____ (2.0) ^Tb ilisi$____(2.0) ^TV $____(2.0) ^TV s$____(2.0) ^ITV $____ (1.9) ^1b n$____(1.8) ^massiv e$____ (1.8) ^expensiv e$____ (1.8) ^offensiv e$____ (1.8) ^aggressiv e$____ (1.8) ^SUV$ ____ (1.7) ^UAW $____ (1.6) ^vib rant$____ (1.6) ^credib ility$____ (1.6) ^incredib le$____ (1.6) ^incredib ly$____ (1.6) ^credib le$____ (1.6) ^UC$ ____ (1.6) ^Bib le$____ (1.5) ^sim ilar$____ (1.5) ^sim ply$____ (1.5) ^sim ple$____ (1.5) ^sim ilarly$____ (1.5) ^VW $____(1.5) ^TB $____(1.5) ^fix ed$____ (1.4) Filter 24 (bias = -0.43) #
K
-
U
.
P
g
B
h
O
x
,
v
3
d
2
u
M
q
Z
j
z
b
N
l
5
'
/
\194
9
H
f
r
G
e
8
t
J
n
C
Y
S
U
Q
f
Y
m
"
P
(
n
\194
a
:
v
7
w
X
u
z
p
d
B
\195
y
%
K
D
J
c
H
v
/
c
Y
f
7
k
l
T
A
e
i
F
1
b
I
r
X
B
4
M
Q
t
L
E
$
\163
W
D
x
p
j
0
m
Non-zero for 4.9% of words.
^TOKYO $____ (2.5) ^Kha n$____ (2.2) ^Kha menei$____ (2.2) ^Kha rtoum$____ (2.2) ^Kha lid$____ (2.2) ^Kri s$____ (2.1) ^U.K.$ ____ (2.1) ^K.$ ____ (2.1) ^Kel ly$____ (2.0) ^Kel ler$____ (2.0) ^Kei th$____ (2.0) ^USA $____ (2.0) ^JERUSA LEM$____ (2.0) ^US$ ____ (1.9) ^Kil patrick$____ (1.9) ^Kra ft$____ (1.8) ^Kyl e$____ (1.8) ^Kyi $____ (1.8) ^GPS$ ____ (1.7) ^PKK$ ____ (1.7) ^Kea ne$____ (1.7) ^Ky$ ____ (1.7) ^Ken nedy$____ (1.7) ^Ken ya$____ (1.7) ^Ken t$____ (1.7) ^Ken tucky$____ (1.7) ^CBS$ ____ (1.6) ^UBS$ ____ (1.6) ^RBS$ ____ (1.6) ^PBS$ ____ (1.6) ^Khm er$____ (1.6) ^Kon g$____ (1.6) ^LOS$ ____ (1.6) ^HBOS$ ____ (1.6) ^OS$ ____ (1.6) ^Kai ne$____ (1.5) ^Kai ser$____ (1.5) ^Kin g$____ (1.5) ^Kin gdom$____ (1.5) ^Kin gs$____ (1.5) Filter 25 (bias = -0.39) #
Y
.
S
c
i
-
3
d
h
v
l
w
J
'
5
m
o
g
H
b
4
\163
O
n
7
e
6
f
,
z
1
a
B
u
L
s
K
r
9
Z
V
u
Z
D
X
t
g
N
i
d
m
r
w
E
!
F
G
T
Y
R
p
o
W
U
n
f
4
v
5
e
6
\$
b
.
K
c
s
\163
Q
u
"
-
X
h
K
.
P
d
V
t
R
j
Y
w
G
n
k
i
9
g
8
e
O
H
C
l
p
s
$
m
r
L
7
v
'
o
b
D
Non-zero for 21.4% of words.
^Sir $____ (1.4) ^MVP $____ (1.3) ^big$ ____ (1.3) ^Big$ ____ (1.3) ^Craig$ ____ (1.3) ^dig$ ____ (1.3) ^immigr ation$____ (1.3) ^immigr ants$____ (1.3) ^Citigr oup$____ (1.3) ^immigr ant$____ (1.3) ^migr ants$____ (1.3) ^Immigr ation$____ (1.3) ^migr ation$____ (1.3) ^migr ant$____ (1.3) ^Hawaii$ ____ (1.3) ^Wii$ ____ (1.3) ^hik e$____ (1.3) ^hik es$____ (1.3) ^hik ing$____ (1.3) ^imp ortant$____ (1.3) ^imp act$____ (1.3) ^simp ly$____ (1.3) ^imp rove$____ (1.3) ^simp le$____ (1.3) ^him$ ____ (1.2) ^claim$ ____ (1.2) ^Muslim$ ____ (1.2) ^victim$ ____ (1.2) ^TV$ ____ (1.2) ^ITV$ ____ (1.2) ^MTV$ ____ (1.2) ^CCTV$ ____ (1.2) ^lik e$____ (1.2) ^lik ely$____ (1.2) ^unlik ely$____ (1.2) ^lik es$____ (1.2) ^relationship $____ (1.2) ^leadership $____ (1.2) ^ship $____ (1.2) ^championship $____ (1.2) Filter 26 (bias = -0.46) #
.
p
Q
i
N
z
Z
P
H
k
<BOS>
T
X
o
/
J
F
G
\194
v
8
O
q
K
7
m
"
f
^
l
4
U
n
\195
A
B
0
s
v
y
\194
r
x
H
J
U
l
A
0
g
9
?
6
h
B
E
-
F
o
u
X
P
5
O
z
1
V
k
b
\163
W
s
7
C
'
t
Y
i
P
x
C
h
I
-
p
v
Q
u
z
j
U
S
D
J
T
o
O
4
r
f
K
b
c
s
/
e
X
E
y
9
A
5
t
Y
G
6
d
3
Non-zero for 27.1% of words.
^vy ing$____(2.1) ^v$ ____(1.8) ^Chevr olet$____ (1.8) ^Chevr on$____ (1.8) ^va lue$____(1.8) ^va rious$____(1.8) ^va st$____(1.8) ^va riety$____(1.8) ^2.0$ ____ (1.8) ^1.9$ ____ (1.7) ^2.9$ ____ (1.7) ^0.9$ ____ (1.7) ^3.9$ ____ (1.7) ^4.9$ ____ (1.7) ^1.6$ ____ (1.7) ^0.6$ ____ (1.7) ^2.6$ ____ (1.7) ^3.6$ ____ (1.7) ^\194\174 $____(1.7) ^JP Morgan$____(1.7) ^JP $____(1.7) ^Levy $____ (1.7) ^levy $____ (1.7) ^inva sion$____ (1.7) ^inva ded$____ (1.7) ^canva s$____ (1.7) ^inva sive$____ (1.7) ^NBC $____ (1.7) ^MSNBC $____ (1.7) ^CNBC $____ (1.7) ^Ava tar$____ (1.6) ^Favr e$____ (1.6) ^Lavr ov$____ (1.6) ^1.5$ ____ (1.6) ^2.5$ ____ (1.6) ^3.5$ ____ (1.6) ^0.5$ ____ (1.6) ^Ivy $____ (1.5) ^BP $____(1.5) ^heavy $____ (1.5) Filter 27 (bias = -0.46) #
p
b
s
.
P
B
C
q
d
v
O
w
S
A
y
l
,
h
G
T
8
u
3
J
1
N
Q
m
2
\195
5
M
"
L
F
H
7
Y
I
<BOS>
V
f
Y
t
X
r
Z
F
W
d
H
s
6
o
4
D
i
c
m
N
)
.
G
\$
b
u
M
y
!
e
J
S
7
,
w
O
g
E
3
R
Y
w
\194
g
"
A
V
c
x
I
u
t
6
z
b
O
m
r
h
E
'
n
8
.
9
p
M
o
f
a
v
N
F
e
$
2
7
G
Q
D
Non-zero for 11.9% of words.
^FOX$ ____ (1.7) ^XV I$____(1.6) ^HIV$ ____ (1.6) ^IV$ ____ (1.6) ^SUV$ ____ (1.5) ^Yu shchenko$____(1.4) ^six $____ (1.4) ^six th$____ (1.4) ^six -month$____ (1.4) ^six -year$____ (1.4) ^six -party$____ (1.4) ^V$ ____(1.4) ^Xb ox$____(1.4) ^opiu m$____ (1.4) ^MOSCOW$ ____ (1.3) ^smu ggling$____ (1.3) ^Rasmu ssen$____ (1.3) ^smu ggled$____ (1.3) ^Pittsbu rgh$____ (1.3) ^Petersbu rg$____ (1.3) ^Johannesbu rg$____ (1.3) ^Sainsbu ry$____ (1.3) ^possib le$____ (1.3) ^responsib le$____ (1.3) ^responsib ility$____ (1.3) ^possib ility$____ (1.3) ^sim ilar$____ (1.3) ^sim ply$____ (1.3) ^sim ple$____ (1.3) ^sim ilarly$____ (1.3) ^sim ultaneously$____ (1.3) ^sim pler$____ (1.3) ^sim ilarities$____ (1.3) ^pessim istic$____ (1.3) ^Zu ma$____(1.2) ^Zu rich$____(1.2) ^Six $____ (1.2) ^Y$ ____(1.2) ^X$ ____(1.2) ^VW $____(1.2) Filter 28 (bias = -0.32) #
H
f
4
K
Z
z
V
O
7
t
1
l
n
U
Y
o
W
m
g
P
X
p
q
s
3
D
2
L
6
y
h
F
9
T
-
c
^
S
Q
r
X
h
J
x
Z
y
6
c
5
p
V
a
&
f
Q
r
M
.
I
d
/
k
K
u
2
v
Y
?
\194
b
3
g
7
q
F
\163
s
g
f
G
B
Y
K
V
F
A
N
.
P
Z
o
C
x
Q
,
X
v
z
y
w
U
$
u
j
9
H
e
8
M
D
J
d
Non-zero for 13.8% of words.
^Hig h$____ (1.7) ^Hig hway$____ (1.7) ^Hig her$____ (1.7) ^Hig hland$____ (1.7) ^Hog an$____ (1.5) ^nig ht$____ (1.3) ^overnig ht$____ (1.3) ^tonig ht$____ (1.3) ^nig hts$____ (1.3) ^Veg as$____ (1.3) ^Wig an$____ (1.3) ^gig $____ (1.3) ^Hug hes$____ (1.2) ^Hug o$____ (1.2) ^Hug h$____ (1.2) ^Vog ue$____ (1.2) ^neg otiations$____ (1.2) ^neg ative$____ (1.2) ^neg otiating$____ (1.2) ^neg otiate$____ (1.2) ^Schwarzeneg ger$____ (1.2) ^neg otiated$____ (1.2) ^neg otiators$____ (1.2) ^neg otiator$____ (1.2) ^Eg ypt$____(1.2) ^Eg yptian$____(1.2) ^hig h$____ (1.1) ^hig her$____ (1.1) ^hig hest$____ (1.1) ^hig hly$____ (1.1) ^Michig an$____ (1.1) ^hig h-profile$____ (1.1) ^hig hway$____ (1.1) ^hig hlighted$____ (1.1) ^HIV $____ (1.1) ^Hag ue$____ (1.1) ^Sg t.$____(1.1) ^Sg t$____(1.1) ^ig nored$____(1.1) ^ig nore$____(1.1) Filter 29 (bias = -0.61) #
<BOS>
y
V
-
X
u
Q
t
7
f
Z
p
Y
o
L
v
5
d
G
w
8
i
6
a
9
m
S
c
\194
h
J
q
^
n
4
k
j
T
l
x
s
P
S
T
E
B
w
b
-
m
g
q
j
;
4
p
5
X
\$
l
3
a
e
D
2
L
G
v
O
k
(
\195
o
Y
i
d
M
U
I
y
p
v
y
u
,
-
K
j
X
k
L
R
P
b
/
T
O
J
Q
E
1
.
A
t
2
M
6
g
5
B
G
c
a
w
3
o
W
e
7
h
Non-zero for 15.7% of words.
^17-y ear-old$____ (1.8) ^27-y ear-old$____ (1.8) ^37-y ear-old$____ (1.8) ^7-y ear-old$____ (1.8) ^sp okesman$____(1.7) ^sp ending$____(1.7) ^sp ent$____(1.7) ^sp ecial$____(1.7) ^sy stem$____(1.7) ^sy stems$____(1.7) ^sy mptoms$____(1.7) ^sy mbol$____(1.7) ^sy mpathy$____(1.7) ^sy ndrome$____(1.7) ^15-y ear-old$____ (1.6) ^25-y ear-old$____ (1.6) ^35-y ear-old$____ (1.6) ^5-y ear-old$____ (1.6) ^VEG AS$____ (1.5) ^TVs$ ____ (1.5) ^Sp ain$____(1.5) ^Sp anish$____(1.5) ^Sp ace$____(1.5) ^Sp eaking$____(1.5) ^18-y ear-old$____ (1.5) ^28-y ear-old$____ (1.5) ^38-y ear-old$____ (1.5) ^16-y ear-old$____ (1.5) ^26-y ear-old$____ (1.5) ^36-y ear-old$____ (1.5) ^Sy ria$____(1.5) ^Sy stems$____(1.5) ^Sy dney$____(1.5) ^Sy stem$____(1.5) ^19-y ear-old$____ (1.5) ^29-y ear-old$____ (1.5) ^39-y ear-old$____ (1.5) ^9-y ear-old$____ (1.5) ^Ey e$____(1.5) ^ANGEL ES$____ (1.4) Filter 30 (bias = -0.45) #
C
u
Z
-
V
v
Q
t
7
T
F
i
G
m
5
o
8
y
X
W
A
w
9
f
L
d
0
E
n
e
c
h
<EOS>
H
<BOS>
U
R
O
S
M
W
j
?
l
"
J
y
n
Q
P
(
0
H
p
!
%
a
D
U
-
:
C
h
f
E
\195
u
z
q
L
.
g
O
R
Y
F
\$
5
&
o
X
o
L
-
V
u
b
i
Z
t
G
f
Q
I
Y
w
g
,
.
n
m
v
A
s
l
k
7
R
F
O
8
p
M
U
6
d
\194
a
"
N
Non-zero for 19.3% of words.
^VW$ ____ (2.0) ^Cab inet$____ (1.9) ^Cab le$____ (1.9) ^Cab rera$____ (1.9) ^anyb ody$____ (1.7) ^cyb er$____ (1.7) ^Cub a$____ (1.7) ^Cub an$____ (1.7) ^Cub s$____ (1.7) ^Cub ans$____ (1.7) ^W. $____(1.7) ^UAW$ ____ (1.6) ^AZUZ $____ (1.6) ^Cam eron$____ (1.5) ^Cam pbell$____ (1.5) ^Cam bridge$____ (1.5) ^Cam p$____ (1.5) ^everyb ody$____ (1.5) ^Everyb ody$____ (1.5) ^WA SHINGTON$____(1.4) ^WA M$____(1.4) ^CCTV $____ (1.4) ^Fab io$____ (1.4) ^Fab regas$____ (1.4) ^Cyr us$____ (1.4) ^Gab riel$____ (1.4) ^VEG AS$____ (1.4) ^Cal ifornia$____ (1.4) ^Cal if$____ (1.4) ^PRNewswire-FirstCal l$____ (1.4) ^Cal l$____ (1.4) ^Cal deron$____ (1.4) ^Cal gary$____ (1.4) ^Cal $____ (1.4) ^anonym ity$____ (1.3) ^anym ore$____ (1.3) ^anonym ous$____ (1.3) ^ANGEL ES$____ (1.3) ^policym akers$____ (1.3) ^FOX $____ (1.3) Filter 31 (bias = -0.59) #
<BOS>
A
W
.
v
L
V
y
"
n
p
D
9
h
X
u
k
H
Q
t
P
l
6
m
b
N
E
/
2
U
J
F
\194
r
G
o
7
c
x
C
o
H
R
m
9
t
K
h
G
T
8
q
N
i
O
A
5
k
&
a
J
y
0
g
3
u
S
e
7
b
Q
.
z
d
C
v
s
W
c
w
L
v
y
k
/
-
m
j
X
g
Z
w
H
R
Q
i
D
t
K
p
U
x
A
o
.
s
M
E
6
c
8
I
l
J
$
0
r
'
Non-zero for 14.7% of words.
^BERL IN$____ (2.5) ^envoy $____ (2.2) ^convoy $____ (2.2) ^voy age$____ (2.2) ^Wom en$____ (2.0) ^Wom an$____ (2.0) ^vom iting$____ (2.0) ^NL $____(1.9) ^boy $____ (1.8) ^boy s$____ (1.8) ^boy friend$____ (1.8) ^boy cott$____ (1.8) ^Cowboy s$____ (1.8) ^Playboy $____ (1.8) ^Joy ce$____ (1.8) ^ISL AMABAD$____ (1.7) ^psy chological$____ (1.6) ^psy chiatric$____ (1.6) ^autopsy $____ (1.6) ^psy chologist$____ (1.6) ^psy chology$____ (1.6) ^psy chiatrist$____ (1.6) ^bom b$____ (1.6) ^bom bing$____ (1.6) ^bom bs$____ (1.6) ^bom ber$____ (1.6) ^ANGEL ES$____ (1.5) ^Ry an$____(1.5) ^Ry der$____(1.5) ^Ry anair$____(1.5) ^Gom ez$____ (1.5) ^Roy al$____ (1.4) ^Roy $____ (1.4) ^Roy als$____ (1.4) ^PL C$____(1.4) ^LL C$____(1.3) ^LL P$____(1.3) ^19-y ear-old$____ (1.3) ^29-y ear-old$____ (1.3) ^39-y ear-old$____ (1.3) Filter 32 (bias = -0.55) #
Q
j
O
v
W
k
G
n
<BOS>
F
z
u
S
T
s
h
K
f
"
t
/
H
<EOS>
D
^
q
Y
P
E
M
U
e
d
-
J
r
E
m
r
v
N
C
e
l
O
z
3
'
2
s
Q
n
J
t
9
k
8
x
\$
c
R
i
I
\194
?
U
"
.
X
d
7
V
\163
u
(
A
Y
F
W
c
X
f
H
C
J
p
l
k
w
y
-
t
\195
x
/
s
L
d
$
j
\194
P
e
D
r
S
'
v
\163
Non-zero for 17.1% of words.
^NY SE$____(2.0) ^NY $____(2.0) ^NY C$____(2.0) ^Orl eans$____ (2.0) ^Orl ando$____ (2.0) ^TOKY O$____ (1.8) ^Or$ ____ (1.7) ^ANGEL ES$____ (1.7) ^Ori oles$____ (1.6) ^NW $____(1.6) ^GE$ ____ (1.6) ^WASHINGTON$ ____ (1.6) ^LONDON$ ____ (1.6) ^BOSTON$ ____ (1.6) ^ON$ ____ (1.6) ^Ora nge$____ (1.6) ^Ora cle$____ (1.6) ^Wri ght$____ (1.5) ^Wri ters$____ (1.5) ^Wri ter$____ (1.5) ^Wri ting$____ (1.5) ^El $____(1.5) ^El izabeth$____(1.5) ^El ection$____(1.5) ^El ectric$____(1.5) ^FTSE$ ____ (1.5) ^Wel l$____ (1.5) ^Wel sh$____ (1.5) ^Wel ls$____ (1.5) ^Wel come$____ (1.5) ^Gri ffin$____ (1.4) ^Gra nd$____ (1.4) ^Gra ham$____ (1.4) ^Gra nt$____ (1.4) ^Gra y$____ (1.4) ^Gro up$____ (1.3) ^Gro ss$____ (1.3) ^Gro wth$____ (1.3) ^Gro ve$____ (1.3) ^NH S$____(1.3) Filter 33 (bias = -0.58) #
Y
w
H
.
h
-
R
z
k
v
V
d
S
a
F
c
M
o
4
p
"
n
3
l
C
x
7
q
Q
e
8
t
i
I
r
\163
P
g
U
<BOS>
b
t
g
f
!
y
V
M
G
,
x
O
.
T
Y
F
7
K
z
o
Z
D
X
U
?
N
Q
i
0
P
:
m
9
e
v
H
-
B
a
u
j
y
J
k
5
h
6
m
9
U
7
H
-
T
X
u
0
i
e
A
\194
r
2
p
l
g
N
a
8
Y
Q
c
E
t
I
C
F
b
3
\163
Non-zero for 21.6% of words.
^Mbe ki$____ (1.6) ^subj ect$____ (1.6) ^subj ects$____ (1.6) ^subj ected$____ (1.6) ^neighbo rhood$____ (1.3) ^neighbo rs$____ (1.3) ^neighbo ring$____ (1.3) ^neighbo rhoods$____ (1.3) ^4.5 $____ (1.3) ^sub- prime$____ (1.3) ^describe d$____ (1.2) ^libe ral$____ (1.2) ^Libe ral$____ (1.2) ^Tibe t$____ (1.2) ^Forbe s$____ (1.2) ^absorbe d$____ (1.2) ^disturbe d$____ (1.2) ^Herbe rt$____ (1.2) ^3.5 $____ (1.2) ^possibl e$____ (1.2) ^responsibl e$____ (1.2) ^possibl y$____ (1.2) ^impossibl e$____ (1.2) ^eligibl e$____ (1.2) ^YouTube $____ (1.2) ^tube $____ (1.2) ^Tube $____ (1.2) ^tube s$____ (1.2) ^marbl e$____ (1.2) ^4.6 $____ (1.2) ^U.S.- led$____ (1.2) ^U.S.- backed$____ (1.2) ^publ ic$____ (1.1) ^Republ ican$____ (1.1) ^publ ished$____ (1.1) ^Republ icans$____ (1.1) ^4.9 $____ (1.1) ^be $____(1.1) ^be en$____(1.1) ^be fore$____(1.1) Filter 34 (bias = -0.43) #
X
C
2
c
W
k
Z
p
E
'
6
t
<BOS>
o
4
g
e
-
K
u
3
r
N
s
8
R
5
l
Q
h
w
d
9
m
B
T
7
i
L
v
T
x
O
F
t
b
o
n
W
h
M
f
K
a
E
!
D
V
w
s
z
L
&
C
c
7
"
P
v
k
G
A
\194
\$
Y
r
0
p
X
H
Z
t
V
T
b
p
9
y
J
O
G
o
8
h
7
D
R
,
6
c
X
e
Y
f
L
q
Q
r
4
i
U
d
s
S
3
\163
z
I
5
A
Non-zero for 15.7% of words.
^MTV $____ (2.3) ^TV $____(2.1) ^TV s$____(2.1) ^ITV $____ (2.1) ^DETR OIT$____ (1.9) ^basketb all$____ (1.8) ^setb ack$____ (1.8) ^setb acks$____ (1.8) ^Basketb all$____ (1.8) ^Kob e$____ (1.8) ^Tb ilisi$____(1.8) ^Nob el$____ (1.7) ^Nob ody$____ (1.7) ^Nob le$____ (1.7) ^DENV ER$____ (1.7) ^AZUZ $____ (1.7) ^Bob $____ (1.6) ^Bob by$____ (1.6) ^Ob ama$____(1.6) ^Ob viously$____(1.6) ^Ob server$____(1.6) ^Ob amas$____(1.6) ^SEOU L$____ (1.6) ^MV P$____(1.6) ^ET$ ____ (1.6) ^CEOs $____ (1.5) ^newb orn$____ (1.5) ^Job s$____ (1.5) ^Job $____ (1.5) ^Hezb ollah$____ (1.5) ^Mob ile$____ (1.5) ^T-Mob ile$____ (1.5) ^DV D$____(1.4) ^DV Ds$____(1.4) ^CEO$ ____ (1.4) ^ob vious$____(1.3) ^ob tained$____(1.3) ^ob viously$____(1.3) ^ob tain$____(1.3) ^Web $____ (1.3) Filter 35 (bias = -0.82) #
9
m
R
.
J
y
3
-
E
d
B
p
7
t
8
c
4
g
5
'
N
n
Y
f
S
w
6
v
2
a
0
l
<BOS>
q
K
u
Q
x
V
\163
A
v
H
f
C
b
I
x
(
B
l
m
1
K
g
M
h
w
&
e
/
J
7
)
t
-
D
k
Q
E
Y
9
r
W
:
z
,
;
!
o
F
o
'
J
k
K
x
\195
t
O
s
G
f
L
v
N
C
r
\194
Y
S
D
.
1
V
3
d
R
Q
0
c
E
"
T
m
z
$
Z
n
M
Non-zero for 15.3% of words.
^RAF $____ (4.0) ^IRA$ ____ (3.2) ^ERA$ ____ (3.2) ^PARIS $____ (2.9) ^FARC$ ____ (2.6) ^HMRC$ ____ (2.6) ^IAEA$ ____ (2.6) ^NBA$ ____ (2.6) ^BA$ ____ (2.6) ^MRI$ ____ (2.6) ^NAS A$____ (2.5) ^NAS CAR$____ (2.5) ^NAS DAQ$____ (2.5) ^1991$ ____ (2.5) ^91$ ____ (2.5) ^RBIs $____ (2.5) ^FRAN CISCO$____ (2.4) ^TEHRAN $____ (2.4) ^BCS $____ (2.3) ^EDF $____ (2.3) ^NHS $____ (2.3) ^1997$ ____ (2.3) ^97$ ____ (2.3) ^19th $____ (2.3) ^9th $____ (2.3) ^19th -century$____ (2.3) ^BAA $____ (2.2) ^DNA$ ____ (2.2) ^Els ewhere$____ (2.2) ^Rit chie$____ (2.2) ^Ris k$____ (2.2) ^Ris ing$____ (2.2) ^Jr. $____ (2.2) ^1974 $____ (2.1) ^Elv is$____ (2.1) ^Riv er$____ (2.1) ^Riv era$____ (2.1) ^Riv ers$____ (2.1) ^Riv erside$____ (2.1) ^USA$ ____ (2.1) Filter 36 (bias = -0.51) #
M
p
H
.
u
z
T
c
4
a
J
x
j
g
B
G
i
d
Y
l
6
s
3
A
V
o
k
r
9
b
W
w
R
\163
\194
y
F
O
1
L
z
f
&
F
.
y
!
M
Q
m
A
k
l
h
G
P
:
i
Y
e
(
H
I
p
g
)
w
B
a
t
R
j
O
T
\195
u
-
n
X
v
G
u
p
t
X
-
Q
j
y
v
K
l
"
s
8
h
O
n
Z
B
c
f
P
J
W
.
V
i
$
A
Y
H
\163
R
7
I
1
k
2
w
Non-zero for 20.7% of words.
^Hap py$____ (2.3) ^May $____ (2.1) ^May or$____ (2.1) ^May be$____ (2.1) ^May o$____ (2.1) ^Hay es$____ (2.1) ^Hay den$____ (2.1) ^Hay ward$____ (2.1) ^Hop e$____ (2.0) ^Hop kins$____ (2.0) ^Hop efully$____ (2.0) ^BAG HDAD$____ (2.0) ^sculp ture$____ (2.0) ^sculp tures$____ (2.0) ^M.$ ____ (1.9) ^G.M.$ ____ (1.9) ^H.$ ____ (1.9) ^Moy es$____ (1.8) ^Mac $____ (1.8) ^Mac y$____ (1.8) ^Mac k$____ (1.8) ^July $____ (1.7) ^truly $____ (1.7) ^FEMA$ ____ (1.7) ^Cruz$ ____ (1.7) ^Kilp atrick$____ (1.7) ^GMAC $____ (1.6) ^4.8 $____ (1.6) ^HIV $____ (1.5) ^Jap an$____ (1.5) ^Jap anese$____ (1.5) ^MIAMI$ ____ (1.5) ^EMI$ ____ (1.5) ^Bap tist$____ (1.5) ^family $____ (1.5) ^daily $____ (1.5) ^easily $____ (1.5) ^heavily $____ (1.5) ^subp rime$____ (1.5) ^AG $____(1.5) Filter 37 (bias = -0.59) #
-
k
d
f
L
B
7
t
l
F
1
S
X
c
Z
p
.
x
D
K
H
T
6
y
/
v
J
m
I
s
u
b
2
M
Q
,
\194
r
^
i
U
j
K
g
!
h
Q
v
P
S
a
t
;
-
Z
e
z
x
/
0
X
F
L
i
\195
c
&
4
:
5
m
o
y
\$
%
s
b
E
r
n
C
p
Z
f
V
y
9
r
R
e
0
l
U
q
G
h
5
a
z
m
\194
t
8
-
4
x
7
o
c
d
6
O
Y
.
$
i
M
P
s
\163
Non-zero for 15.3% of words.
^AZUZ $____ (2.7) ^UC LA$____(2.4) ^UC $____(2.4) ^10-K$ ____ (2.3) ^PRNewswire-US Newswire$____ (2.0) ^IPC C$____ (2.0) ^LLC $____ (1.9) ^QC $____(1.9) ^PC $____(1.9) ^PC s$____(1.9) ^LPG A$____ (1.8) ^daz zling$____ (1.7) ^candidac y$____ (1.6) ^headac hes$____ (1.6) ^headac he$____ (1.6) ^PR Newswire$____(1.5) ^PR Newswire-FirstCall$____(1.5) ^PR $____(1.5) ^Us ing$____(1.5) ^Us $____(1.5) ^Us e$____(1.5) ^Us ers$____(1.5) ^Us ually$____(1.5) ^LLP$ ____ (1.5) ^LC D$____(1.5) ^Florida$ ____ (1.5) ^Canada$ ____ (1.5) ^agenda$ ____ (1.5) ^Qaeda$ ____ (1.5) ^blaz e$____ (1.4) ^Plaz a$____ (1.4) ^laz y$____ (1.4) ^Blaz ers$____ (1.4) ^Port-au -Prince$____ (1.4) ^co-au thor$____ (1.4) ^das h$____ (1.4) ^Zaz i$____ (1.4) ^UB S$____(1.4) ^La$ ____ (1.3) ^plac e$____ (1.3) Filter 38 (bias = -0.44) #
K
-
B
.
9
u
G
t
8
d
V
m
P
H
0
n
5
h
3
l
X
g
2
y
6
'
S
q
7
o
k
w
z
j
4
r
Q
/
C
e
o
H
O
F
-
d
S
)
K
1
G
D
Y
C
&
I
%
k
'
A
"
q
s
P
z
n
f
T
x
h
w
a
:
u
N
2
.
t
W
i
Q
m
I
f
E
v
S
-
A
p
O
x
2
n
W
y
X
u
$
c
7
M
4
o
5
h
k
b
d
P
'
J
q
Non-zero for 11.9% of words.
^Kos ovo$____ (2.0) ^HBOS $____ (1.9) ^BOS TON$____ (1.9) ^Go$ ____ (1.8) ^Bos ton$____ (1.7) ^Bos nia$____ (1.7) ^Bos nian$____ (1.7) ^HBO$ ____ (1.6) ^PKK$ ____ (1.5) ^TOKYO $____ (1.5) ^Boa rd$____ (1.5) ^CHICAGO$ ____ (1.4) ^DIEGO$ ____ (1.4) ^CBS$ ____ (1.4) ^UBS$ ____ (1.4) ^RBS$ ____ (1.4) ^PBS$ ____ (1.4) ^Pos t$____ (1.4) ^Pos ted$____ (1.4) ^Pos tal$____ (1.4) ^Pos ada$____ (1.4) ^So$ ____ (1.4) ^Bot h$____ (1.3) ^Bow l$____ (1.3) ^BEI JING$____ (1.3) ^Davydenko$ ____ (1.3) ^Tymoshenko$ ____ (1.3) ^Yushchenko$ ____ (1.3) ^Co$ ____ (1.2) ^DETROI T$____ (1.2) ^Jo$ ____ (1.2) ^PGA $____ (1.2) ^LPGA $____ (1.2) ^U.K.$ ____ (1.2) ^K.$ ____ (1.2) ^IPO$ ____ (1.2) ^Got $____ (1.2) ^Bol ton$____ (1.2) ^Bol ivia$____ (1.2) ^Bol t$____ (1.2) Filter 39 (bias = -0.53) #
x
R
c
U
p
H
d
<BOS>
l
u
.
Z
o
k
D
J
y
E
h
M
v
V
L
I
t
w
g
3
f
0
S
a
C
Q
-
&
v
Y
p
/
i
Z
d
X
f
G
x
K
e
C
u
"
w
\194
q
L
n
M
J
S
a
V
j
U
I
(
o
A
h
E
g
-
h
J
c
I
T
'
B
s
A
Q
t
R
D
P
y
$
k
j
F
O
v
C
L
q
H
a
0
x
U
N
Non-zero for 2.9% of words.
^18- year-old$____ (1.9) ^team- mate$____ (1.9) ^team- mates$____ (1.9) ^US- led$____ (1.9) ^day- to-day$____ (1.9) ^play- off$____ (1.9) ^28- year-old$____ (1.8) ^D- Calif$____(1.8) ^D- N.Y.$____(1.8) ^38- year-old$____ (1.8) ^15- year-old$____ (1.8) ^17- year-old$____ (1.7) ^U.S.- led$____ (1.7) ^U.S.- backed$____ (1.7) ^full- time$____ (1.7) ^all- time$____ (1.7) ^well- known$____ (1.7) ^All- Star$____ (1.7) ^next- generation$____ (1.7) ^by- election$____ (1.7) ^5- 4$____(1.7) ^5- 0$____(1.7) ^5- 1$____(1.7) ^5- 2$____(1.7) ^7- 6$____(1.7) ^7- 5$____(1.7) ^7- year-old$____(1.7) ^back- to-back$____ (1.7) ^Secretary- General$____ (1.7) ^16- year-old$____ (1.7) ^T- shirts$____(1.7) ^T- shirt$____(1.7) ^T- Mobile$____(1.7) ^25- year-old$____ (1.7) ^27- year-old$____ (1.7) ^built- in$____ (1.7) ^45- year-old$____ (1.6) ^6- 3$____(1.6) ^6- 4$____(1.6) ^6- 2$____(1.6) Filter 40 (bias = -0.59) #
h
f
Y
E
7
k
\194
U
1
w
H
P
6
m
4
e
5
K
8
z
l
b
n
r
/
B
x
s
^
F
9
t
C
T
3
p
X
M
0
\163
Q
k
X
m
7
t
8
A
6
h
9
i
2
T
"
g
5
o
3
c
4
B
:
y
\194
p
Z
a
(
w
\$
U
E
u
d
r
J
z
!
l
d
k
s
B
F
T
D
b
5
Y
C
m
j
q
7
W
I
o
2
h
6
r
8
M
L
w
G
H
4
y
S
v
0
N
z
K
1
i
E
\195
Non-zero for 15.1% of words.
^published $____ (1.6) ^reached $____ (1.6) ^sched uled$____ (1.6) ^launched $____ (1.6) ^high-d efinition$____ (1.5) ^thes e$____ (1.4) ^Thes e$____ (1.4) ^highes t$____ (1.4) ^Manches ter$____ (1.4) ^787 $____ (1.4) ^175 $____ (1.3) ^165 $____ (1.3) ^Yes $____ (1.3) ^Yes terday$____ (1.3) ^high-s peed$____ (1.2) ^cash-s trapped$____ (1.2) ^195 0s$____ (1.2) ^195 9$____ (1.2) ^195 0$____ (1.2) ^195 3$____ (1.2) ^195 7$____ (1.2) ^195 8$____ (1.2) ^195 5$____ (1.2) ^195 6$____ (1.2) ^QC $____(1.2) ^125 $____ (1.1) ^1970s $____ (1.1) ^70s $____ (1.1) ^197 9$____ (1.0) ^197 6$____ (1.0) ^197 2$____ (1.0) ^197 4$____ (1.0) ^197 8$____ (1.0) ^197 0$____ (1.0) ^75 $____(1.0) ^75 0$____(1.0) ^75 ,000$____(1.0) ^75 0,000$____(1.0) ^85 $____(1.0) ^85 0$____(1.0) Filter 41 (bias = -0.48) #
p
u
G
.
P
A
O
L
i
h
W
F
X
N
K
t
V
H
2
B
3
U
1
D
z
M
6
T
x
j
Q
E
"
l
'
e
7
r
5
q
Z
x
X
p
!
o
Q
h
?
k
M
g
W
S
/
l
H
v
2
j
K
s
L
R
6
-
(
i
U
r
N
t
w
c
:
f
8
'
y
z
D
p
F
x
u
g
j
w
H
G
M
b
L
o
t
a
\194
W
/
y
U
O
C
c
R
z
I
K
6
r
7
\163
l
i
Z
q
J
h
T
k
Non-zero for 15.4% of words.
^Zu ma$____(2.2) ^Zu rich$____(2.2) ^therapeu tic$____ (2.1) ^BAGHD AD$____ (2.1) ^pau se$____ (1.9) ^compet ition$____ (1.7) ^compet itive$____ (1.7) ^compet e$____ (1.7) ^compet ing$____ (1.7) ^LOND ON$____ (1.6) ^happy$ ____ (1.6) ^therapy$ ____ (1.6) ^copy$ ____ (1.6) ^spy$ ____ (1.6) ^MD C$____(1.6) ^FOX$ ____ (1.6) ^Qu een$____(1.5) ^Qu ote$____(1.5) ^Qu eens$____(1.5) ^Qu inn$____(1.5) ^Qu estion$____(1.5) ^Qu ality$____(1.5) ^Qu eensland$____(1.5) ^Qu estions$____(1.5) ^FOXN ews.com$____ (1.5) ^IMF $____ (1.5) ^HD $____(1.5) ^pat ients$____ (1.5) ^pat ient$____ (1.5) ^pat h$____ (1.5) ^participat e$____ (1.5) ^anticipat ed$____ (1.5) ^pat rol$____ (1.5) ^pat tern$____ (1.5) ^Mu slim$____(1.4) ^Mu rray$____(1.4) ^Mu sharraf$____(1.4) ^Mu seum$____(1.4) ^pm$ ____ (1.4) ^Corp.$ ____ (1.4) Filter 42 (bias = -0.46) #
Y
F
v
r
u
e
i
f
T
j
W
.
U
Q
\194
N
z
S
m
p
o
g
J
c
1
\163
B
E
H
P
-
b
6
x
0
A
w
y
l
8
Z
-
X
p
Q
f
Y
x
&
v
K
t
/
d
8
g
9
h
L
e
V
j
U
k
6
.
7
s
M
r
"
i
3
q
G
a
(
u
\194
n
G
d
U
f
z
x
K
-
A
v
Z
F
L
p
Y
q
R
t
O
e
\195
'
J
j
o
h
B
\194
E
n
w
.
3
\163
0
k
M
6
T
I
Non-zero for 4.7% of words.
^ATLA NTA$____ (1.6) ^YOU $____ (1.5) ^CITY$ ____ (1.5) ^SEATTLE $____ (1.3) ^AZUZ$ ____ (1.3) ^Buzz $____ (1.3) ^buzz $____ (1.3) ^puzz le$____ (1.3) ^pizz a$____ (1.2) ^TOK YO$____ (1.2) ^THA T$____ (1.2) ^DETRO IT$____ (1.1) ^YOR K$____ (1.1) ^TVs $____ (1.1) ^EBITDA $____ (1.1) ^TV$ ____ (1.0) ^ITV$ ____ (1.0) ^MTV$ ____ (1.0) ^CCTV$ ____ (1.0) ^UK$ ____ (1.0) ^TM$ ____ (1.0) ^THE $____ (0.9) ^TOR ONTO$____ (0.9) ^Zo ne$____(0.9) ^Zo o$____(0.9) ^Arizo na$____ (0.9) ^Verizo n$____ (0.9) ^horizo n$____ (0.9) ^TAR P$____ (0.9) ^WHO $____ (0.8) ^NYC$ ____ (0.8) ^KA BUL$____(0.8) ^al-Qa ida$____ (0.7) ^al-Qa eda$____ (0.7) ^Al-Qa eda$____ (0.7) ^\194\174$ ____ (0.7) ^WASHINGTON $____ (0.7) ^BOSTON $____ (0.7) ^HOUSTON $____ (0.7) ^TB$ ____ (0.7) Filter 43 (bias = -0.37) #
<BOS>
m
I
y
Q
h
O
u
S
b
\194
x
X
f
R
.
5
g
l
c
7
n
^
v
t
M
,
-
z
Z
/
k
H
a
F
U
g
f
Z
x
!
o
H
S
m
N
?
v
V
B
G
F
X
s
A
E
w
9
Y
R
i
\$
1
t
n
,
)
e
C
O
y
K
M
"
/
8
S
u
V
-
Q
a
F
d
j
o
X
U
M
\195
\194
z
$
y
"
v
4
n
5
L
7
p
1
w
D
J
m
q
i
Non-zero for 15.7% of words.
^HS BC$____(2.2) ^HIV$ ____ (2.1) ^IV$ ____ (2.1) ^Sgt .$____ (2.1) ^Sgt $____ (2.1) ^MIAM I$____ (1.9) ^AIG$ ____ (1.9) ^IMF $____ (1.8) ^CIA$ ____ (1.8) ^FIA$ ____ (1.8) ^PHILADELPHIA$ ____ (1.8) ^AIDS $____ (1.8) ^NHS $____ (1.7) ^In$ ____ (1.7) ^HOUS TON$____ (1.7) ^FDIC$ ____ (1.6) ^ITV $____ (1.6) ^AS $____(1.6) ^HM RC$____(1.5) ^!$ ____(1.5) ^FOX$ ____ (1.5) ^Alge ria$____ (1.5) ^ge t$____(1.4) ^ge tting$____(1.4) ^ge neral$____(1.4) ^ge ts$____(1.4) ^H$ ____(1.4) ^ICC $____ (1.4) ^Rutge rs$____ (1.4) ^USA$ ____ (1.4) ^NASA$ ____ (1.4) ^FSA$ ____ (1.4) ^TSA$ ____ (1.4) ^MRSA$ ____ (1.4) ^XV I$____(1.4) ^film$ ____ (1.3) ^calm$ ____ (1.3) ^Film$ ____ (1.3) ^Palm$ ____ (1.3) ^Int ernational$____ (1.3) Filter 44 (bias = -0.55) #
n
T
7
c
9
t
H
v
a
m
Q
o
I
O
Z
g
4
M
8
D
6
G
3
y
V
z
A
-
N
E
5
p
2
e
X
\163
B
j
F
d
K
d
J
F
o
t
;
h
O
C
b
D
B
s
w
A
Y
u
X
g
\195
.
W
H
M
c
N
\$
G
j
%
y
9
n
r
1
R
I
"
)
S
-
Q
m
F
p
C
i
\194
\195
s
b
5
u
t
P
$
y
c
J
"
r
8
o
4
H
7
q
,
w
/
a
A
l
N
g
h
T
Non-zero for 18.3% of words.
^diagnos ed$____ (2.0) ^nos e$____ (2.0) ^diagnos is$____ (2.0) ^Buenos $____ (2.0) ^HBOS $____ (1.9) ^BOS TON$____ (1.9) ^LOS $____ (1.9) ^NYS E$____ (1.9) ^YORK$ ____ (1.9) ^Hos pital$____ (1.8) ^Hos pitals$____ (1.8) ^Hos sein$____ (1.8) ^chaos $____ (1.8) ^RBS $____ (1.8) ^K$ ____(1.8) ^not $____ (1.7) ^anot her$____ (1.7) ^not hing$____ (1.7) ^cannot $____ (1.7) ^no$ ____ (1.7) ^Leno$ ____ (1.7) ^casino$ ____ (1.7) ^piano$ ____ (1.7) ^Latino$ ____ (1.7) ^knoc ked$____ (1.7) ^innoc ent$____ (1.7) ^genoc ide$____ (1.7) ^knoc k$____ (1.7) ^OS $____(1.7) ^IRS $____ (1.7) ^BS T$____(1.7) ^CBS $____ (1.7) ^PBS $____ (1.6) ^IOC $____ (1.6) ^Hot el$____ (1.5) ^Hot $____ (1.5) ^Hot els$____ (1.5) ^chaot ic$____ (1.5) ^KA BUL$____(1.5) ^IMF $____ (1.5) Filter 45 (bias = -0.56) #
S
J
y
v
t
-
s
b
h
Z
F
\195
,
0
O
n
A
9
f
X
U
V
"
w
E
<BOS>
.
z
H
P
c
q
'
B
Q
6
u
<EOS>
d
7
Q
h
K
g
P
v
X
j
I
-
/
x
Z
u
N
s
!
l
,
S
2
m
8
t
;
i
U
.
&
b
O
k
9
o
"
T
3
e
W
c
Q
U
X
k
W
z
"
G
\194
B
I
c
q
C
e
R
-
o
$
s
d
K
.
x
t
b
H
J
2
A
L
\195
p
f
n
Non-zero for 7.8% of words.
^PRNewswire-USNe wswire$____ (1.8) ^NASDAQ$ ____ (1.6) ^AFP$ ____ (1.5) ^ESPN $____ (1.5) ^LSU$ ____ (1.5) ^OK$ ____ (1.5) ^UK$ ____ (1.3) ^GOP$ ____ (1.3) ^FOX $____ (1.3) ^FOX News.com$____ (1.3) ^AP$ ____ (1.2) ^GAAP$ ____ (1.2) ^non-GAAP$ ____ (1.2) ^SAP$ ____ (1.2) ^Sae ed$____ (1.2) ^UPI $____ (1.2) ^DAX$ ____ (1.2) ^USC$ ____ (1.2) ^Sad dam$____ (1.1) ^Sad r$____ (1.1) ^al-Sad r$____ (1.1) ^Sad ly$____ (1.1) ^U.K. $____ (1.1) ^DUP$ ____ (1.1) ^Sat urday$____ (1.1) ^Sat urn$____ (1.1) ^Q$ ____(1.1) ^Q. $____(1.0) ^TSB$ ____ (1.0) ^tyre s$____ (1.0) ^Syd ney$____ (1.0) ^Stre et$____ (1.0) ^tre atment$____ (1.0) ^stre et$____ (1.0) ^centre $____ (1.0) ^OPE C$____ (0.9) ^Kenya$ ____ (0.9) ^Zelaya$ ____ (0.9) ^Libya$ ____ (0.9) ^Chechnya$ ____ (0.9) Filter 46 (bias = -0.66) #
<BOS>
A
-
g
'
D
\194
L
"
T
W
c
Q
E
^
G
f
h
/
0
F
a
z
r
e
p
l
B
U
1
v
A
-
r
\194
y
W
F
X
L
"
U
6
C
J
%
9
h
q
H
w
s
x
g
0
S
;
,
V
a
'
P
T
\$
Y
N
:
D
M
c
Q
-
/
p
L
i
A
g
F
w
U
v
N
J
B
j
\194
d
"
x
K
o
.
e
X
1
C
n
D
E
$
u
,
0
Z
h
S
G
8
2
Non-zero for 6.3% of words.
^--$ ____ (2.5) ^---$ ____ (2.5) ^v. $____(2.4) ^v$ ____(2.4) ^Gov. $____ (2.1) ^Nov$ ____ (2.0) ^Berbatov$ ____ (2.0) ^Lavrov$ ____ (2.0) ^vy ing$____(1.9) ^Aviv$ ____ (1.8) ^'ve $____ (1.7) ^va lue$____(1.7) ^va rious$____(1.7) ^va st$____(1.7) ^va riety$____(1.7) ^vs .$____(1.7) ^vs $____(1.7) ^African-A merican$____ (1.7) ^African-A mericans$____ (1.7) ^-$ ____(1.6) ^savvy $____ (1.6) ^WA SHINGTON$____(1.5) ^WA M$____(1.5) ^anti-A merican$____ (1.4) ^al-Q aida$____ (1.4) ^al-Q aeda$____ (1.4) ^Al-Q aeda$____ (1.4) ^Wi-F i$____ (1.4) ^7-6$ ____ (1.3) ^4-6$ ____ (1.3) ^3-6$ ____ (1.3) ^Tsva ngirai$____ (1.3) ^approva l$____ (1.3) ^innova tive$____ (1.3) ^innova tion$____ (1.3) ^remova l$____ (1.3) ^inva sion$____ (1.3) ^inva ded$____ (1.3) ^canva s$____ (1.3) ^inva sive$____ (1.3) Filter 47 (bias = -0.37) #
9
m
7
y
5
.
3
g
N
U
2
c
4
b
8
u
I
T
6
p
<BOS>
k
S
z
Q
-
E
'
R
M
0
v
1
d
J
t
W
l
,
L
Y
n
T
f
v
r
\194
.
G
?
S
F
W
N
0
!
"
-
O
w
z
Z
V
A
;
H
6
\$
c
I
X
L
h
u
x
a
i
e
o
y
Q
i
.
p
"
T
/
w
L
k
\194
J
'
v
$
o
s
t
Z
h
X
1
U
B
8
j
n
H
0
e
g
q
2
Non-zero for 11.1% of words.
^NY$ ____ (4.1) ^IT$ ____ (3.5) ^DETROIT$ ____ (3.5) ^CIT$ ____ (3.5) ^NYS E$____ (3.4) ^90$ ____ (3.3) ^1990$ ____ (3.3) ^190$ ____ (3.3) ^NYC $____ (3.3) ^3G$ ____ (3.2) ^T. $____(3.2) ^BEIJING$ ____ (3.2) ^BST$ ____ (3.2) ^EST$ ____ (3.2) ^1990s $____ (3.2) ^90s $____ (3.2) ^mid-1990s $____ (3.2) ^ET$ ____ (3.1) ^ISL AMABAD$____ (3.1) ^70$ ____ (3.1) ^1970$ ____ (3.1) ^170$ ____ (3.1) ^270$ ____ (3.1) ^No. $____ (3.1) ^NW$ ____ (3.0) ^v. $____(3.0) ^50$ ____ (3.0) ^150$ ____ (3.0) ^250$ ____ (3.0) ^350$ ____ (3.0) ^450$ ____ (3.0) ^750$ ____ (3.0) ^2050$ ____ (3.0) ^1950$ ____ (3.0) ^AIG$ ____ (3.0) ^1970s $____ (3.0) ^70s $____ (3.0) ^30$ ____ (3.0) ^130$ ____ (3.0) ^2030$ ____ (3.0) Filter 48 (bias = -0.37) #
D
f
R
y
<BOS>
w
\194
p
7
m
j
K
0
s
J
x
Y
a
v
O
T
W
9
i
Q
S
u
o
d
,
X
e
I
E
l
U
C
A
6
c
Q
i
.
p
N
h
?
H
&
1
(
g
"
k
/
d
!
l
K
P
M
4
c
J
'
x
Z
j
r
a
\194
%
X
u
O
T
R
v
:
0
Q
g
"
j
W
D
B
d
N
-
X
p
\194
c
K
i
b
0
$
G
/
l
'
1
9
J
Y
e
T
t
h
n
C
o
Non-zero for 9.5% of words.
^D.$ ____ (3.3) ^Ph.D.$ ____ (3.3) ^R.$ ____ (3.3) ^2007.$ ____ (3.1) ^Maj.$ ____ (3.1) ^J.$ ____ (3.0) ^N.J.$ ____ (3.0) ^ACORN$ ____ (2.9) ^N.Y.$ ____ (2.9) ^D-N.Y.$ ____ (2.9) ^Gov.$ ____ (2.9) ^Rev.$ ____ (2.9) ^v.$ ____ (2.9) ^T.$ ____ (2.9) ^2009.$ ____ (2.9) ^Q.$ ____ (2.8) ^NW $____(2.8) ^Ltd.$ ____ (2.8) ^CNB C$____ (2.8) ^0.9 $____ (2.8) ^I.$ ____ (2.8) ^Col.$ ____ (2.8) ^council.$ ____ (2.8) ^D.C.$ ____ (2.7) ^C.$ ____ (2.7) ^N.C.$ ____ (2.7) ^S.C.$ ____ (2.7) ^0.8 $____ (2.7) ^NB C$____(2.6) ^NB A$____(2.6) ^Q$ ____(2.6) ^.$ ____(2.5) ^2008.$ ____ (2.5) ^H.$ ____ (2.5) ^V.$ ____ (2.5) ^CNN $____ (2.5) ^IN$ ____ (2.4) ^BERLIN$ ____ (2.4) ^al-Qa ida$____ (2.4) ^al-Qa eda$____ (2.4) Filter 49 (bias = -0.59) #
X
u
2
h
<BOS>
.
I
o
6
c
W
U
V
T
3
l
Q
v
5
s
Z
A
4
L
7
m
P
z
9
D
K
g
n
t
^
r
1
b
8
d
x
u
o
j
c
-
K
H
W
P
a
t
8
F
5
U
.
M
0
E
9
I
B
i
N
e
z
T
G
k
X
s
Y
d
\194
r
!
m
q
;
d
h
v
f
c
r
D
H
z
b
T
L
0
u
p
m
I
N
t
M
\194
F
\163
Y
C
R
2
A
X
n
Q
x
G
J
1
S
W
B
O
s
Non-zero for 27.3% of words.
^Xav ier$____ (1.7) ^Vod afone$____ (1.5) ^Wad e$____ (1.3) ^Wac hovia$____ (1.3) ^exc hange$____ (1.3) ^exc ept$____ (1.3) ^exc ellent$____ (1.3) ^exc lusive$____ (1.3) ^iPod $____ (1.3) ^Zac h$____ (1.2) ^2nd $____ (1.2) ^22nd $____ (1.2) ^nod $____ (1.2) ^od ds$____(1.2) ^od d$____(1.2) ^nov el$____ (1.2) ^innov ative$____ (1.2) ^innov ation$____ (1.2) ^nov els$____ (1.2) ^ov er$____(1.2) ^ov erall$____(1.2) ^ov erseas$____(1.2) ^ov ernight$____(1.2) ^ov erhaul$____(1.2) ^knoc ked$____ (1.2) ^innoc ent$____ (1.2) ^genoc ide$____ (1.2) ^knoc k$____ (1.2) ^oc curred$____(1.2) ^oc cur$____(1.2) ^oc casion$____(1.2) ^oc ean$____(1.2) ^oc cupied$____(1.2) ^oc casionally$____(1.2) ^Waz iristan$____ (1.2) ^sixt h$____ (1.2) ^mixt ure$____ (1.2) ^fixt ure$____ (1.2) ^fixt ures$____ (1.2) ^period $____ (1.1) Filter 50 (bias = -0.62) #
w
l
y
\194
f
Y
K
D
m
7
c
T
r
j
.
d
Z
0
n
6
N
J
M
v
s
R
U
h
a
<BOS>
e
9
g
1
p
u
k
X
E
L
Q
f
W
j
a
-
A
m
X
p
I
c
:
M
&
o
(
v
!
x
2
n
H
'
q
F
E
g
U
k
"
s
N
C
Y
J
/
y
i
W
C
X
d
Q
s
"
p
M
D
Y
c
w
P
$
R
\194
j
m
n
Z
g
K
u
V
F
q
r
/
l
b
z
0
U
k
I
Non-zero for 11.4% of words.
^UAW $____ (2.8) ^Iowa$ ____ (2.8) ^Ottawa$ ____ (2.8) ^renewab le$____ (2.4) ^NEW $____ (2.4) ^Kenya$ ____ (2.2) ^Zelaya$ ____ (2.2) ^Libya$ ____ (2.2) ^Chechnya$ ____ (2.2) ^Uefa$ ____ (2.1) ^Fifa$ ____ (2.1) ^sofa$ ____ (2.1) ^Hatoyam a$____ (2.1) ^fam ily$____ (2.0) ^fam ilies$____ (2.0) ^fam ous$____ (2.0) ^fam iliar$____ (2.0) ^payab le$____ (1.9) ^fab ric$____ (1.8) ^fab ulous$____ (1.8) ^Obama$ ____ (1.7) ^drama$ ____ (1.7) ^Oklahoma$ ____ (1.7) ^Alabama$ ____ (1.7) ^America$ ____ (1.7) ^Africa$ ____ (1.7) ^Jessica$ ____ (1.7) ^Monica$ ____ (1.7) ^draw $____ (1.7) ^draw n$____ (1.7) ^draw ing$____ (1.7) ^withdraw al$____ (1.7) ^withdraw $____ (1.7) ^raw $____ (1.7) ^unaw are$____ (1.6) ^runaw ay$____ (1.6) ^way $____ (1.6) ^away $____ (1.6) ^alway s$____ (1.6) ^way s$____ (1.6) Filter 51 (bias = -0.61) #
2
k
1
g
D
b
d
m
I
V
N
h
6
S
3
x
8
Y
9
r
/
'
,
p
W
j
u
A
5
l
o
G
K
F
-
c
O
s
U
C
z
F
K
g
&
h
U
j
9
e
;
\$
B
H
W
.
R
y
Y
f
o
n
v
t
J
S
\194
A
O
r
\195
?
"
m
X
s
Q
x
T
M
o
k
O
F
K
g
N
m
/
v
Y
t
Q
b
"
e
\194
j
9
i
$
h
X
p
8
u
R
f
W
d
3
H
5
T
G
P
s
V
Non-zero for 10.7% of words.
^eurozo ne$____ (2.3) ^zo ne$____(2.0) ^zo nes$____(2.0) ^zo o$____(2.0) ^Amazo n$____ (2.0) ^Amazo n.com$____ (2.0) ^doo r$____ (1.8) ^doo rs$____ (1.8) ^outdoo r$____ (1.8) ^indoo r$____ (1.8) ^doo med$____ (1.8) ^outdoo rs$____ (1.8) ^indoo rs$____ (1.8) ^advo cates$____ (1.8) ^advo cate$____ (1.8) ^advo cacy$____ (1.8) ^advo cated$____ (1.8) ^Ivo ry$____ (1.8) ^Ko rea$____(1.7) ^Ko rean$____(1.7) ^Ko ng$____(1.7) ^Ko sovo$____(1.7) ^Ko reans$____(1.7) ^Ko be$____(1.7) ^Arizo na$____ (1.6) ^Verizo n$____ (1.6) ^horizo n$____ (1.6) ^Cruz$ ____ (1.6) ^Woo ds$____ (1.5) ^Woo d$____ (1.5) ^Woo dward$____ (1.5) ^Woo dy$____ (1.5) ^provo ked$____ (1.4) ^provo cative$____ (1.4) ^provo ke$____ (1.4) ^29$ ____ (1.3) ^TOKY O$____ (1.3) ^LONDON $____ (1.3) ^Buzz $____ (1.3) ^buzz $____ (1.3) Filter 52 (bias = -0.78) #
d
k
y
b
D
B
L
v
1
R
H
<BOS>
2
V
/
x
O
'
P
j
6
S
X
f
p
r
,
s
W
c
a
J
3
F
I
w
l
9
U
g
u
j
Y
F
v
e
B
f
a
p
U
P
h
g
W
M
T
\$
o
-
R
S
z
5
\194
r
&
G
q
E
"
X
k
I
A
Z
x
m
9
J
Q
f
7
m
X
o
6
w
4
.
8
c
"
t
\194
n
2
y
Y
B
9
x
$
A
1
r
V
K
3
a
I
p
5
k
H
v
b
z
Non-zero for 12.0% of words.
^Hindu$ ____ (2.7) ^du$ ____ (2.7) ^Florida$ ____ (2.4) ^Canada$ ____ (2.4) ^agenda$ ____ (2.4) ^Qaeda$ ____ (2.4) ^Kenya$ ____ (2.4) ^Zelaya$ ____ (2.4) ^Libya$ ____ (2.4) ^Chechnya$ ____ (2.4) ^NASDAQ $____ (2.1) ^do$ ____ (2.0) ^Colorado$ ____ (2.0) ^Orlando$ ____ (2.0) ^Fernando$ ____ (2.0) ^Tokyo$ ____ (2.0) ^Mayo$ ____ (2.0) ^Da$ ____ (1.9) ^Baghdad $____ (1.9) ^dad $____ (1.9) ^Trinidad $____ (1.9) ^Hu$ ____ (1.9) ^La$ ____ (1.8) ^DAX $____ (1.8) ^197 0s$____ (1.7) ^197 9$____ (1.7) ^197 6$____ (1.7) ^197 2$____ (1.7) ^EDT$ ____ (1.6) ^1.7 $____ (1.6) ^Do$ ____ (1.6) ^Ltd.$ ____ (1.6) ^dragonfly.$ ____ (1.6) ^Dad $____ (1.5) ^Hud son$____ (1.4) ^due $____ (1.4) ^subdue d$____ (1.4) ^Purdue $____ (1.4) ^overdue $____ (1.4) ^196 0s$____ (1.4) Filter 53 (bias = -0.49) #
<BOS>
p
j
P
S
a
.
K
M
o
s
\195
F
l
g
B
'
y
Q
1
t
z
E
x
V
i
\194
L
"
d
-
J
4
q
e
T
^
U
Z
,
z
h
R
F
-
y
v
L
k
f
'
S
I
\$
Q
H
;
e
\195
x
b
4
&
A
!
5
V
%
r
,
P
3
w
M
C
i
T
N
\194
6
Q
f
Y
x
X
v
Z
p
G
k
/
t
L
n
$
-
"
e
U
F
D
i
V
h
H
o
s
w
B
a
q
j
d
Non-zero for 8.8% of words.
^Schwartz$ ____ (2.1) ^Rodriguez$ ____ (1.9) ^Chavez$ ____ (1.9) ^Gonzalez$ ____ (1.9) ^Ramirez$ ____ (1.9) ^risk$ ____ (1.8) ^ask$ ____ (1.8) ^task$ ____ (1.8) ^desk$ ____ (1.8) ^BERL IN$____ (1.8) ^DENVER$ ____ (1.7) ^]$ ____(1.7) ^FRA NCISCO$____ (1.6) ^Cruz$ ____ (1.6) ^R$ ____(1.6) ^MIAMI$ ____ (1.5) ^EMI$ ____ (1.5) ^JERU SALEM$____ (1.5) ^McQ ueen$____ (1.5) ^--$ ____ (1.4) ^---$ ____ (1.4) ^HMRC $____ (1.4) ^-$ ____(1.4) ^Condoleezz a$____ (1.4) ^Medvedev$ ____ (1.4) ^Kiev$ ____ (1.4) ^v$ ____(1.3) ^week$ ____ (1.3) ^seek$ ____ (1.3) ^Greek$ ____ (1.3) ^Week$ ____ (1.3) ^Derek$ ____ (1.3) ^Creek$ ____ (1.3) ^two-week$ ____ (1.3) ^k$ ____(1.3) ^ERA $____ (1.3) ^Fitzg erald$____ (1.2) ^PRNewswire-U SNewswire$____ (1.2) ^USC$ ____ (1.2) ^MRI $____ (1.2) Filter 54 (bias = -0.42) #
9
m
4
t
2
.
3
-
8
'
1
f
7
l
6
r
5
p
0
<BOS>
E
o
J
k
Z
y
G
b
H
T
U
g
R
v
L
w
D
c
X
O
X
h
V
L
Q
l
W
x
M
.
Z
f
)
d
"
u
;
a
k
%
w
y
T
A
G
o
K
s
\194
F
2
r
n
\$
-
m
P
.
D
x
J
h
G
q
U
w
L
t
K
v
Z
a
8
k
3
-
6
W
1
b
7
'
0
g
9
n
2
f
R
A
5
o
X
m
M
c
Non-zero for 5.5% of words.
^1998 $____ (2.4) ^1920 s$____ (2.4) ^220 $____ (2.3) ^1993 $____ (2.3) ^1996 $____ (2.3) ^1991 $____ (2.3) ^3-D $____ (2.3) ^1968 $____ (2.3) ^1997 $____ (2.3) ^1948 $____ (2.3) ^1990 s$____ (2.3) ^1990 $____ (2.3) ^mid-1990 s$____ (2.3) ^1999 $____ (2.3) ^203 0$____ (2.3) ^401 $____ (2.3) ^201 0$____ (2.3) ^201 1$____ (2.3) ^201 2$____ (2.3) ^201 3$____ (2.3) ^201 4$____ (2.3) ^201 5$____ (2.3) ^201 6$____ (2.3) ^201 8$____ (2.3) ^225 $____ (2.3) ^1963 $____ (2.3) ^1966 $____ (2.3) ^1961 $____ (2.3) ^198 0s$____ (2.3) ^198 9$____ (2.3) ^198 8$____ (2.3) ^198 0$____ (2.3) ^198 4$____ (2.3) ^198 6$____ (2.3) ^1943 $____ (2.2) ^1946 $____ (2.2) ^1941 $____ (2.2) ^1978 $____ (2.2) ^1992 $____ (2.2) ^1958 $____ (2.2) Filter 55 (bias = -0.67) #
m
E
'
2
f
3
.
1
k
9
l
N
b
4
y
0
t
J
p
7
s
5
U
8
<BOS>
I
V
R
M
6
/
e
z
H
C
q
v
o
P
h
N
m
I
y
9
g
Q
i
7
p
(
k
8
h
R
%
2
s
5
b
q
f
\$
G
X
M
3
-
&
U
0
u
6
'
\194
T
1
Y
D
x
G
A
Q
h
P
t
"
B
O
n
E
q
-
H
X
a
8
.
V
l
p
u
Z
N
$
v
z
,
R
x
s
L
3
i
'
T
K
k
S
w
Non-zero for 10.3% of words.
^ANG ELES$____ (2.7) ^AIG $____ (2.4) ^WASHING TON$____ (2.2) ^BEIJING $____ (2.2) ^'Ne al$____ (2.2) ^'Ne ill$____ (2.2) ^NO T$____(2.0) ^NO $____(2.0) ^BNP $____ (2.0) ^UN$ ____ (1.9) ^SNP $____ (1.9) ^NE W$____(1.9) ^autumn$ ____ (1.8) ^column$ ____ (1.8) ^condemn$ ____ (1.8) ^ESPN$ ____ (1.8) ^1.9$ ____ (1.8) ^2.9$ ____ (1.8) ^0.9$ ____ (1.8) ^3.9$ ____ (1.8) ^4.9$ ____ (1.8) ^5.9$ ____ (1.8) ^columns $____ (1.8) ^IP CC$____(1.7) ^IP $____(1.7) ^IP O$____(1.7) ^IO C$____(1.6) ^N$ ____(1.6) ^same- sex$____ (1.6) ^prime- time$____ (1.6) ^same- store$____ (1.6) ^XVI$ ____ (1.6) ^MIAMI$ ____ (1.5) ^EMI$ ____ (1.5) ^3G $____(1.5) ^1.7$ ____ (1.5) ^2.7$ ____ (1.5) ^0.7$ ____ (1.5) ^3.7$ ____ (1.5) ^UPI$ ____ (1.5) Filter 56 (bias = -0.60) #
-
F
.
C
<BOS>
k
w
h
z
B
E
P
u
p
'
T
Q
1
W
0
m
i
/
D
<EOS>
S
s
c
I
,
o
y
a
8
4
7
H
(
m
7
v
Q
k
\$
z
4
T
3
U
5
B
j
b
N
c
8
p
:
a
2
t
S
f
6
y
I
P
9
u
X
%
1
K
H
x
?
)
K
t
6
.
P
j
2
r
a
g
3
c
z
M
U
F
W
n
p
T
8
h
i
k
9
'
1
e
x
N
G
u
5
D
7
-
,
q
J
A
Non-zero for 8.2% of words.
^al-Qa ida$____ (2.6) ^al-Qa eda$____ (2.6) ^Al-Qa eda$____ (2.6) ^6-7$ ____ (2.1) ^al-Sa dr$____ (2.1) ^6-4$ ____ (1.9) ^5-4$ ____ (1.9) ^3-4$ ____ (1.9) ^6-3$ ____ (1.9) ^4-3$ ____ (1.9) ^2-3$ ____ (1.9) ^1-3$ ____ (1.9) ^5-3$ ____ (1.9) ^3-3$ ____ (1.9) ^1.7$ ____ (1.8) ^2.7$ ____ (1.8) ^0.7$ ____ (1.8) ^3.7$ ____ (1.8) ^7-5$ ____ (1.7) ^mid-19 90s$____ (1.7) ^1.25 $____ (1.7) ^1.4$ ____ (1.6) ^2.4$ ____ (1.6) ^0.4$ ____ (1.6) ^3.4$ ____ (1.6) ^1.3$ ____ (1.6) ^0.3$ ____ (1.6) ^2.3$ ____ (1.6) ^3.3$ ____ (1.6) ^6-2$ ____ (1.5) ^3-2$ ____ (1.5) ^2-2$ ____ (1.5) ^1-2$ ____ (1.5) ^4-2$ ____ (1.5) ^south-ea st$____ (1.5) ^1.5$ ____ (1.4) ^2.5$ ____ (1.4) ^3.5$ ____ (1.4) ^0.5$ ____ (1.4) ^PRNewswire-Fi rstCall$____ (1.4) Filter 57 (bias = -0.39) #
S
u
<BOS>
n
Q
m
O
U
j
-
G
v
Y
a
5
y
\194
w
F
\195
E
d
"
H
s
q
7
Z
t
P
g
B
^
1
l
J
V
b
i
\194
p
6
y
D
r
v
m
X
k
9
g
0
i
7
f
8
s
5
w
Q
%
(
P
&
a
4
-
N
O
2
?
J
'
B
U
j
n
"
b
J
y
z
h
B
p
K
d
R
f
9
m
N
i
Z
x
X
t
\195
H
Q
-
0
F
5
'
E
g
I
u
L
k
A
c
G
e
w
\163
Y
q
Non-zero for 6.4% of words.
^NASDA Q$____ (2.1) ^USDA $____ (2.1) ^MSNB C$____ (1.9) ^Suz uki$____ (1.7) ^ISLA MABAD$____ (1.6) ^SEA TTLE$____ (1.5) ^TSB$ ____ (1.4) ^Sel ect$____ (1.4) ^BST$ ____ (1.4) ^EST$ ____ (1.4) ^BOSTO N$____ (1.4) ^HOUSTO N$____ (1.4) ^Seb astian$____ (1.3) ^Seb elius$____ (1.3) ^NYSE$ ____ (1.3) ^FTSE$ ____ (1.3) ^SEO UL$____ (1.3) ^Seo ul$____ (1.2) ^WASHI NGTON$____ (1.2) ^NASCA R$____ (1.2) ^HSBC $____ (1.2) ^Sul livan$____ (1.1) ^U.S.$ ____ (1.1) ^S.$ ____ (1.1) ^SAN $____ (1.1) ^SNP $____ (1.1) ^FOXN ews.com$____ (1.1) ^Slo vakia$____ (1.1) ^Slo venia$____ (1.1) ^Sub s$____ (1.1) ^DJ $____(1.1) ^USS$ ____ (1.0) ^Sea $____ (1.0) ^Sea ttle$____ (1.0) ^Sea n$____ (1.0) ^Sea rch$____ (1.0) ^SEC $____ (1.0) ^ESPN $____ (1.0) ^St$ ____ (1.0) ^PRNewswire-USNe wswire$____ (1.0) Filter 58 (bias = -0.49) #
V
T
<BOS>
t
Z
y
x
O
\194
o
F
r
b
i
Q
u
9
p
7
E
6
w
X
H
8
I
5
D
'
-
C
a
4
d
<EOS>
h
^
\163
L
q
R
y
s
p
V
D
-
T
'
h
J
c
9
q
k
e
\194
?
)
m
S
H
j
L
z
t
&
d
Y
O
Q
A
b
\163
u
K
C
o
7
1
O
v
N
k
/
b
K
j
y
x
o
g
Q
T
,
F
Z
h
3
u
W
V
X
s
r
B
$
t
L
E
n
e
d
0
i
J
Non-zero for 16.5% of words.
^TVs$ ____ (1.5) ^abso lutely$____ (1.4) ^abso lute$____ (1.4) ^Gibso n$____ (1.4) ^abso rb$____ (1.4) ^Ry an$____(1.2) ^Ry der$____(1.2) ^Ry anair$____(1.2) ^six-y ear$____ (1.1) ^sy stem$____(1.1) ^sy stems$____(1.1) ^sy mptoms$____(1.1) ^sy mbol$____(1.1) ^sy mpathy$____(1.1) ^Ro bert$____(1.1) ^Ro yal$____(1.1) ^Ro ad$____(1.1) ^Ro ck$____(1.1) ^so me$____(1.0) ^so $____(1.0) ^so mething$____(1.0) ^so n$____(1.0) ^so on$____(1.0) ^so cial$____(1.0) ^19-y ear-old$____ (1.0) ^29-y ear-old$____ (1.0) ^39-y ear-old$____ (1.0) ^9-y ear-old$____ (1.0) ^17-y ear-old$____ (1.0) ^27-y ear-old$____ (1.0) ^37-y ear-old$____ (1.0) ^7-y ear-old$____ (1.0) ^Pennsy lvania$____ (1.0) ^16-y ear-old$____ (1.0) ^26-y ear-old$____ (1.0) ^36-y ear-old$____ (1.0) ^jobs$ ____ (1.0) ^clubs$ ____ (1.0) ^bombs$ ____ (1.0) ^Gibbs$ ____ (1.0) Filter 59 (bias = -0.27) #
.
J
'
K
-
z
<BOS>
2
f
E
h
U
x
0
t
G
"
B
\194
1
Q
P
r
3
m
L
n
5
F
D
q
T
y
\195
S
i
6
9
)
t
Z
o
V
l
6
r
8
A
X
.
4
a
7
%
9
O
2
u
3
N
5
T
M
-
G
h
C
f
1
z
0
q
Q
\195
F
s
"
I
W
J
Q
u
S
-
"
j
Y
n
X
D
\194
R
$
P
O
d
\195
0
r
9
1
F
v
e
Z
U
f
Non-zero for 11.4% of words.
^VW $____(3.3) ^1.6$ ____ (3.2) ^0.6$ ____ (3.2) ^2.6$ ____ (3.2) ^3.6$ ____ (3.2) ^1.8$ ____ (3.1) ^2.8$ ____ (3.1) ^0.8$ ____ (3.1) ^3.8$ ____ (3.1) ^1.4$ ____ (3.0) ^2.4$ ____ (3.0) ^0.4$ ____ (3.0) ^3.4$ ____ (3.0) ^4.4$ ____ (3.0) ^7-6$ ____ (2.9) ^4-6$ ____ (2.9) ^3-6$ ____ (2.9) ^1.7$ ____ (2.8) ^2.7$ ____ (2.8) ^0.7$ ____ (2.8) ^3.7$ ____ (2.8) ^4.7$ ____ (2.8) ^5.7$ ____ (2.8) ^6.7$ ____ (2.8) ^)$ ____(2.7) ^1.9$ ____ (2.7) ^2.9$ ____ (2.7) ^0.9$ ____ (2.7) ^3.9$ ____ (2.7) ^6-4$ ____ (2.7) ^5-4$ ____ (2.7) ^3-4$ ____ (2.7) ^[$ ____(2.6) ^V$ ____(2.6) ^1.2$ ____ (2.5) ^0.2$ ____ (2.5) ^2.2$ ____ (2.5) ^3.2$ ____ (2.5) ^6-7$ ____ (2.5) ^1.3$ ____ (2.4) Filter 60 (bias = -0.54) #
p
u
W
F
X
M
I
s
Q
L
2
j
q
U
O
h
a
m
w
D
<BOS>
.
1
R
P
f
i
B
^
J
"
l
7
A
\163
S
K
c
v
2
f
E
r
4
.
3
l
5
m
G
h
X
u
W
t
Z
y
V
x
6
a
w
d
)
o
7
p
9
n
J
k
8
q
Q
b
1
v
0
'
C
u
g
W
G
-
l
a
j
q
V
y
A
f
S
E
c
N
z
e
Y
d
0
w
D
"
L
x
Z
H
5
v
F
2
m
o
/
\163
\194
3
Non-zero for 13.1% of words.
^DIEG O$____ (2.0) ^OEC D$____ (2.0) ^pig s$____ (1.9) ^pig $____ (1.9) ^pil ot$____ (1.9) ^pupil s$____ (1.9) ^pil ots$____ (1.9) ^compil ed$____ (1.9) ^pipel ine$____ (1.9) ^spel l$____ (1.9) ^Capel lo$____ (1.9) ^compel ling$____ (1.9) ^Opel $____ (1.9) ^expel led$____ (1.9) ^OPEC $____ (1.8) ^EC B$____(1.7) ^Olympic $____ (1.6) ^pic k$____ (1.6) ^pic ture$____ (1.6) ^Olympic s$____ (1.6) ^pic ked$____ (1.6) ^pic tures$____ (1.6) ^typic ally$____ (1.6) ^suspic ion$____ (1.6) ^expec ted$____ (1.6) ^spec ial$____ (1.6) ^espec ially$____ (1.6) ^expec t$____ (1.6) ^FRANCISC O$____ (1.6) ^piz za$____ (1.6) ^Lopez $____ (1.5) ^Eg ypt$____(1.5) ^Eg yptian$____(1.5) ^IOC $____ (1.5) ^Wig an$____ (1.5) ^El $____(1.4) ^El izabeth$____(1.4) ^El ection$____(1.4) ^El ectric$____(1.4) ^Wil liams$____ (1.4) Filter 61 (bias = -0.60) #
s
p
h
v
S
T
L
P
A
c
H
K
u
z
F
<BOS>
4
0
U
k
.
w
3
q
Y
X
y
J
7
B
E
t
a
D
8
o
x
-
/
m
z
H
K
F
O
u
G
h
o
m
&
e
R
d
Y
j
9
f
0
.
Q
y
S
-
5
t
3
M
J
q
8
?
/
v
%
\$
,
n
N
k
Q
g
W
v
I
j
"
o
X
-
a
D
,
m
N
c
$
J
/
T
G
u
0
M
l
h
z
d
i
L
Non-zero for 17.2% of words.
^soa ring$____ (2.0) ^soa red$____ (2.0) ^soa p$____ (2.0) ^soa r$____ (2.0) ^Ochoa $____ (1.9) ^hoa x$____ (1.9) ^also$ ____ (1.9) ^so$ ____ (1.9) ^Also$ ____ (1.9) ^Alonso$ ____ (1.9) ^Aso$ ____ (1.9) ^Picasso$ ____ (1.9) ^Barroso$ ____ (1.9) ^who$ ____ (1.8) ^Who$ ____ (1.8) ^Idaho$ ____ (1.8) ^Mourinho$ ____ (1.8) ^Cruz$ ____ (1.6) ^LON DON$____ (1.6) ^Loa n$____ (1.6) ^GAO$ ____ (1.5) ^LG$ ____ (1.5) ^AG$ ____ (1.5) ^WHO$ ____ (1.5) ^So$ ____ (1.5) ^FOX $____ (1.4) ^FOX News.com$____ (1.4) ^PARI S$____ (1.4) ^Ho$ ____ (1.4) ^duo$ ____ (1.4) ^quo$ ____ (1.4) ^sor t$____ (1.4) ^professor $____ (1.4) ^resor t$____ (1.4) ^Professor $____ (1.4) ^author ities$____ (1.3) ^shor t$____ (1.3) ^author ity$____ (1.3) ^author $____ (1.3) ^UK$ ____ (1.3) Filter 62 (bias = -0.61) #
2
m
1
f
0
.
3
'
9
t
5
k
7
M
I
b
J
F
6
y
8
h
4
r
G
u
E
S
z
<BOS>
D
s
X
l
i
x
W
j
P
g
i
c
H
D
W
L
-
z
h
K
Y
.
4
G
k
U
:
N
I
y
V
F
;
o
3
C
1
Z
q
f
u
m
7
A
j
0
2
&
6
r
K
g
O
.
B
d
f
c
N
h
o
u
S
n
9
-
,
D
W
H
Q
v
3
j
"
t
P
C
J
m
R
A
8
q
X
T
E
y
Y
1
Non-zero for 15.0% of words.
^10-K $____ (2.3) ^2-3 $____ (1.8) ^1-3 $____ (1.8) ^14, 000$____ (1.7) ^Gif fords$____ (1.6) ^13, 000$____ (1.6) ^11, 000$____ (1.5) ^2-2 $____ (1.5) ^3-3 $____ (1.5) ^17, 000$____ (1.5) ^22, 000$____ (1.5) ^1-2 $____ (1.5) ^145 $____ (1.4) ^12, 000$____ (1.4) ^16, 000$____ (1.4) ^1949 $____ (1.4) ^24$ ____ (1.4) ^5-3 $____ (1.4) ^14$ ____ (1.3) ^2014$ ____ (1.3) ^1943 $____ (1.3) ^WHO $____ (1.3) ^135 $____ (1.3) ^147 $____ (1.3) ^6-3 $____ (1.3) ^25, 000$____ (1.3) ^dif ferent$____ (1.3) ^dif ficult$____ (1.3) ^dif ference$____ (1.3) ^dif ferences$____ (1.3) ^dif ficulties$____ (1.3) ^Cardif f$____ (1.3) ^dif ficulty$____ (1.3) ^modif ied$____ (1.3) ^BEIJIN G$____ (1.2) ^199 9$____ (1.2) ^199 0s$____ (1.2) ^199 7$____ (1.2) ^199 8$____ (1.2) ^1939 $____ (1.2) Filter 63 (bias = -0.46) #
J
t
Y
y
L
p
R
d
V
q
z
e
9
w
B
H
G
\163
b
I
<EOS>
-
<BOS>
W
l
.
K
i
Z
c
U
g
0
h
5
T
\195
r
8
a
K
F
O
.
W
v
G
d
3
u
Y
f
1
j
w
b
i
-
2
x
o
e
5
t
X
l
Z
h
&
q
4
r
/
\$
,
m
z
D
'
Y
f
H
K
m
o
g
J
h
x
Q
P
$
z
V
p
.
E
M
B
"
9
'
0
X
5
\194
2
W
s
T
a
N
e
,
F
Non-zero for 16.1% of words.
^YORK$ ____ (1.9) ^TOKYO$ ____ (1.9) ^Jim $____ (1.8) ^Jim my$____ (1.8) ^LG$ ____ (1.7) ^VW$ ____ (1.7) ^HBO$ ____ (1.6) ^CHICAGO$ ____ (1.6) ^DIEGO$ ____ (1.6) ^Lim ited$____ (1.6) ^Lim baugh$____ (1.6) ^PKK$ ____ (1.6) ^Joh n$____ (1.6) ^Joh nson$____ (1.6) ^Joh nny$____ (1.6) ^Joh nston$____ (1.6) ^Rom ney$____ (1.5) ^Rom an$____ (1.5) ^Rom e$____ (1.5) ^Rom ania$____ (1.5) ^Rom a$____ (1.5) ^Rom o$____ (1.5) ^Rom anian$____ (1.5) ^Rom ero$____ (1.5) ^Jo$ ____ (1.5) ^UK$ ____ (1.5) ^Lig ht$____ (1.5) ^Rig hts$____ (1.4) ^Rig ht$____ (1.4) ^Rih anna$____ (1.4) ^Log an$____ (1.4) ^Loh an$____ (1.4) ^Rog er$____ (1.4) ^Rog ers$____ (1.4) ^Li$ ____ (1.4) ^1993$ ____ (1.4) ^93$ ____ (1.4) ^Vog ue$____ (1.3) ^Gom ez$____ (1.3) ^bom b$____ (1.3)