Sunday, August 29, 2010

5 indispensable IT skills of the future

From: http://www.computerworld.com/s/article/350908/5_Indispensable_IT_Skills_of_the_Future

Computerworld -
 In the year 2020, technical expertise will no longer be the sole province of the IT department. Employees throughout the organization will understand how to use technology to do their jobs.

Yet futurists and IT experts say that the most sought-after IT-related skills will be those that involve the ability to mine overwhelming amounts of data, protect systems from security threats, manage the risks of growing complexity in new systems, and communicate how technology can increase productivity.

1. Analyzing Data

By 2020, the amount of data generated each year will reach 35 zettabytes, or about 35 million petabytes, according to market researcher IDC. That's enough data to fill a stack of DVDs reaching from the Earth to the moon and back, according to John Gantz, chief research officer at IDC.

Demand will be high for IT workers with the ability to not only analyze dizzying amounts of data, but also work with business units to define what data is needed and where to get it.

These hybrid business-technology employees will have IT expertise and an understanding of business processes and operations. "They are people who understand what information people need" and how that information translates into profitability, says David Foote, president and CEO of IT workforce research firm Foote Partners LLC. "You'll have many more people understanding the whole data 'supply chain,' from information to money," he says.

2. Understanding Risk

Risk management skills will remain in high demand through 2020, says futurist David Pearce Snyder, especially at a time when business wrestles with growing IT complexity. Think of IT problems on the scale of BP's efforts to stop the Gulf of Mexico oil spill, or Toyota's work to correct sudden acceleration in some of its cars, Snyder says.

"When you're in the time of rapid innovation," which is happening now and will continue into 2020, he contends, "you run into the law of unintended consequences -- when you try something brand-new in a complex world, you can be certain that it's going to produce unexpected consequences." Businesses will seek out IT workers with risk management skills to predict and react to these challenges

3. Mastering Robotics

Robots will have taken over more jobs by 2020, according to Joseph Coates, a consulting futurist in Washington. IT workers specializing in robotics will see job opportunities in all markets, he adds.

"You can think of [robots] as humanlike devices, but you have to widen that to talk about anything that is automated," Coates says. Robotics jobs will involve research, maintenance and repair. Specialists will explore uses for the technology in vertical markets. For example, some roboticists might specialize in health care, developing equipment for use in rehabilitation facilities, while others might create devices for the handicapped or learning tools for children.

4. Securing Information

Since we're spending more and more time online, verifying users' identities and protecting privacy will be big challenges by 2020, because fewer interactions will be face-to-face, more personal information may be available online, and new technologies could make it easier to impersonate people, according to a report by PricewaterhouseCoopers. Teleworkers will also represent a larger portion of the workforce, opening up a slew of corporate security risks.

"We're in a dangerous place," because many employees are tech-savvy, yet they "don't understand the first thing about data security," Foote explains. "That will change in 2020," when companies will cast an even wider net over data security -- including the data center, Internet connectivity and remote access, he predicts.

5. Running the Network

Network systems and data communications management will remain a top priority in 2020, but as companies steer away from adding to the payroll, they will turn to consultants to tell them how to be more productive and efficient, says Snyder, who follows predictions from the U.S. Bureau of Labor Statistics.

"You have already cut as many people as you can, so now you can only increase productivity," he says. "Someone has to come in here and tell me how to better use the technology that I have."

Posted via email from Troy's posterous

Friday, August 27, 2010

[Tools] matplotlib

http://matplotlib.sourceforge.net/index.html

Plotting package for python.

intro

matplotlib is a python 2D plotting library which produces publication quality figures in a variety of hardcopy formats and interactive environments across platforms. matplotlib can be used in python scripts, the python and ipython shell (ala MATLAB®* or Mathematica®), web application servers, and six graphical user interface toolkits.

Posted via email from Troy's posterous

Monday, August 16, 2010

[Linux] Graph format conversion commands

1. convert

This command is really powerful, it could deal many kinds of image formats, including different vector graphics. 
With "-trim" to only convert the drawing area without white margins.

2. epstopdf

This command coverts eps files to pdf file in vector form.

Posted via email from Troy's posterous

Wednesday, August 11, 2010

[Statistics] Point Estimation

Download now or preview on posterous
6103_chap01.pdf (399 KB)

From Wiki:

In statistics, point estimation involves the use of sample data to calculate a single value (known as a statistic) which is to serve as a "best guess" for an unknown (fixed or random) population parameter.
More formally, it is the application of a point estimator to the data.
In general, point estimation should be contrasted with interval estimation.
Point estimation should be contrasted with general Bayesian methods of estimation, where the goal is usually to compute (perhaps to an approximation) the posterior distributions of parameters and other quantities of interest. The contrast here is between estimating a single point (point estimation), versus estimating a weighted set of points (a probability density function). However, where appropriate, Bayesian methodology can include the calculation of point estimates, either as the expectation or median of the posterior distribution or as the mode of this distribution.
In a purely frequentist context (as opposed to Bayesian), point estimation should be contrasted with the specific interval estimation calculation of confidence intervals.

From other resources:

For a population whose distribution is known but depends on one or more unknown parameters, point estimation predicts the value of the unknown parameter and interval estimation determines the range of the unknown parameter.

In summarization, point estimation is used to estimate mean, variance, standard deviation, or any other statistical parameter for describing the data.

In time-series prediction, point estimation is used to predict one or more values appearing later in the sequence by calculating parameters for a sample.

Methods to obtain point estimates:
1) moments
2) maximum likelihood estimation
3) Bayes estimators
4) EM
5) robust estimation
...

Criteria to assess estimators:
1) bias
2) mean squared error
3) standard error
4) efficiency 
5) consistency
...

 

Posted via email from Troy's posterous

Thursday, August 5, 2010

[ASR] A Study on Lattice Rescoring with Knowledge Scores for Automatic Speech Recognition

Download now or preview on posterous
isca_rescoring.pdf (181 KB)

15 HMM based articulatory detectors are adopted to generate log-likelihood rate features for a later stage NN to predict phone posteriors.

Meanwhile, those LLRs are directly used to rescoring the lattice generated from the standard HMM ASR systems and has been shown to yield better performance.

The articulatory knowledge scores are generated by those HMM based detectors, which are better than NN based detectors.
The problem with the NN-based scores is that they are likely to fluctuate.

Automatic Speech Attribute Transcription (ASAT) paradigm.

Frame level LLRs are better than segmental level's.

The 15 articulators adopted in this paper are: fricative, vowel, stop, nasal, approximant, low, mid, high, labial, coronal, dental, velar, retroflex, glottal, and silence.

Posted via email from Troy's posterous

[LVCSR] A phonetic feature based lattice rescoring approach to LVCSR

In the previous work, the authors developed a detector based high performance phone recognizer. Articulatory informations are extracted using a bunch of speech feature detectors implemented by MLP. A final event merger, another MLP, combines those different detectors to predict phoneme posteriors, which are used as HMM's emission probabilities for decoding.

In this paper, the detector based phoneme recognizer is extended to LVCSR. With the state-of-the-art HMM based speech recognizer, word lattices are generated. The the high quality monophone posteriors generated by the detector based recognizer is utilized to rescore the lattices for second stage decoding.

Comparing with standard MLE and MMI trained HMM systems, the rescored lattices yield lower WER on WSJ0 corpus.

The system structure is illustrated in the figure below:


Download now or preview on posterous
4960471.pdf (286 KB)

Posted via email from Troy's posterous

Google+