Archived material Access restricted Article
Rantburg

Today's Front Page   View All of Fri 01/11/2013 View Thu 01/10/2013 View Wed 01/09/2013 View Tue 01/08/2013 View Mon 01/07/2013 View Sun 01/06/2013 View Sat 01/05/2013
1
2013-01-11 Government
What Words and Phrases in Your Emails Might the FBI's New Software Be Searching For?
Archived material is restricted to Rantburg regulars and members. If you need access email fred.pruitt=at=gmail.com with your nick to be added to the members list. There is no charge to join Rantburg as a member.
Posted by Uncle Phester 2013-01-11 00:00|| E-Mail|| Front Page|| [282 views ]  Top

#1 Ummm, Bet "Fuck You FBI" is high on the list.
So, now the FBI will waste time on me?
Posted by Redneck Jim 2013-01-11 00:46||   2013-01-11 00:46|| Front Page Top

#2 I suspect that:

"A well regulated Militia, being necessary to the security of a free State, the right of the people to keep and bear Arms, shall not be infringed."

will probably flag a bit of attention as well.
Posted by abu do you love 2013-01-11 01:42||   2013-01-11 01:42|| Front Page Top

#3 It's called "tagging". Palantir Technolgies. The FBI is a client user.

Not only can it identify specific words or groups of words in massive volumes of data, by hitting the "merge" command, you can link everyone else who uses the term with a line and graphically depict the entire network of users.
Posted by Besoeker 2013-01-11 02:21||   2013-01-11 02:21|| Front Page Top

#4 Palantir Technologies

(link fell off on #3, spelling was even worse)
Posted by Besoeker 2013-01-11 02:23||   2013-01-11 02:23|| Front Page Top

#5 It's the phrases it won't be searching for, but should (and won't be added because of PC) that concern me the most.
Posted by Bright Pebbles 2013-01-11 05:03||   2013-01-11 05:03|| Front Page Top

#6 Another 'popular' analysis methodology that may interest curious Rantburgers is Linguistic Inquiry and Word Count.
Posted by Skidmark 2013-01-11 05:35||   2013-01-11 05:35|| Front Page Top

#7 Do they search messages written in Arabic?
Posted by Rambler in Virginia 2013-01-11 09:23||   2013-01-11 09:23|| Front Page Top

#8 1 w0nd3r h0w th3y h4nd13 l33t sp34k?
Posted by Bright Pebbles 2013-01-11 10:44||   2013-01-11 10:44|| Front Page Top

#9 Do they search messages written in Arabic?

Of course not silly - that would be RACIST.

Besides we all know the real terrorists are you 'white crackers' and returning veterans - and the Tea Party of course.

/SARC

As I recall the old 'emacs' text editor used to have a 'spook' command which would insert one of the terms the CIA was supposed to be monitoring emails for...
Posted by CrazyFool 2013-01-11 11:39||   2013-01-11 11:39|| Front Page Top

#10 "Allahu Akbar" will not be tagged.

"Deus Volt" will be tagged.
Posted by charger 2013-01-11 12:44||   2013-01-11 12:44|| Front Page Top

#11 In the Chiefs secondary nobody will cover up, and that the payroll vs talent return is off the books horrible. I just hope nobody will find out, especially thoz Ray-dahs.

That the Chiefs are going to be horrible for two more years is supposed to be a secret, jeesh!

What about Hot Carl, or is that Secret Service territory?
Posted by swksvolFF 2013-01-11 14:35||   2013-01-11 14:35|| Front Page Top

#12 Mucky is the ultimate encrypter. His messages will give the software migraines.
Posted by Alaska Paul 2013-01-11 16:56||   2013-01-11 16:56|| Front Page Top

#13 Does the software understand Bavarian, too?
Posted by European Conservative 2013-01-11 17:23||   2013-01-11 17:23|| Front Page Top

#14 How does the software handle strikethroughs?
Posted by Barbara 2013-01-11 18:05||   2013-01-11 18:05|| Front Page Top

#15 Does anyone have eyes on all that uranium in Niger and Argentina or is it better for the poor people rot have mining jobs kinda like farmers growing poppy in Afghanistan.
Posted by Maggie Flomong2662 2013-01-11 18:16||   2013-01-11 18:16|| Front Page Top

#16 They can't find who is supporting al Qaeda who cares,
Posted by Maggie Flomong2662 2013-01-11 18:17||   2013-01-11 18:17|| Front Page Top

#17 This is an area in which I have some expertise. To answer the questions above:

There are many ways (and many off-the-shelf software packages, and many custom systems) that handle natural language data, including in text form (documents, web pages, tweets, email etc. as opposed to transcribed conversations, which tend to have a different linguistic structure). The state of the art goes well beyond just finding specific words or phrases, but the capabilities of specific systems outside of R&D shops differ greatly. IARPA is already several years into an R&D program on cross-language, cross-culture metaphor identification and interpretation, for instance. NIST has been running text retrieval, topic modeling, content extraction, machine translation etc. challenges / competitions for almost 20 years now. DARPA has a Deep Exploration and Filtering of Text effort that is expected to transition to operational use within DOD within a few years.

l33t sp34k would be fairly easy to deal with. Deleted text is just the regular text surrounded by formatting markers, so no problem there. Tweets / text conventions are already addressed here and there. Many packages handle various languages, including Arabic. Palantir primarily displays and links rather than interpreting the text itself - document information is imported into the tool, but tagging is done manually by analysts before the tool can display and cross-correlate based on those tags.
Posted by lotp 2013-01-11 18:37||   2013-01-11 18:37|| Front Page Top

#18 If you type text in spreadsheet and then cut and paste the text as a 'picture' into you email will the software be able to read it?

Note: there is a function called spedis that converts like sounding terms so intentionally mis spellings won't matter.
Posted by Airandee 2013-01-11 18:50||   2013-01-11 18:50|| Front Page Top

#19 Neither of those presents insuperable technical problems, airandee.
Posted by lotp 2013-01-11 21:56||   2013-01-11 21:56|| Front Page Top

23:16 Zhang Fei
23:11 JosephMendiola
22:40 JosephMendiola
22:37 JosephMendiola
22:35 JosephMendiola
22:23 JosephMendiola
22:20 JosephMendiola
21:56 lotp
21:52 Barbara
21:50 JosephMendiola
21:31 Pappy
20:31 SteveS
20:28 SteveS
20:25 tu3031
20:20 Hyman Roth
20:14 tu3031
20:01 Frank G
19:55 swksvolFF
19:53 Thing From Snowy Mountain
19:50 USN,Ret.
19:39 Dopey Sinatra9196
19:33 Dopey Sinatra9196
19:29 JosephMendiola
19:27 JosephMendiola

Rantburg was assembled from recycled algorithms in the United States of America. No trees were destroyed in the production of this weblog. We did hurt some, though. Sorry.
54.161.145.251


Paypal:
Google
Search WWW Search rantburg.com